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pplication of: 



THE UNITED STATES PATENT AND TRADEMARK OFFICE 



PATENT 



THOMAS J. CARLSON ET AL.. 



CASE NO.: BB1321-1 



APPLICATION NO.: 09/913064 



GROUP ART UNIT: UNKNOWN 



FILED: AUGUST 8, 2001 



LA. APPLICATION NO.: PCT/US00/03453 



EXAMINER: UNKNOWN 



LA. FILING DATE: FEBRUARY 9, 2000 



CONFIRMATION NO.: 3445 



FOR: UDP-GLUCOSE MODIFIERS 



STATEMENT UNDER 37 CFR 1.821(g> and l.S25(V\ 



Commissioner of Patents and Trademarks 
Washington, DC 20231 

Sir: 

The submission of the substitute Sequence Listing filed concurrently herewith does not 
include new matter. 

The copy of the substitute Sequence Listing in computer readable form filed 
concurrently herewith is the same as the paper copy of the substitute Sequence Listing filed 
concurrently herewith. 



Respectfully submitted, 




Paul D. Golian 



Attorney For Applicants 
Registration No. 42,591 
Telephone: 302-992-3749 
Facsimile: 302-892-1026 



Dated: 





PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Application of: 
THOMAS J. CARLSON ET AL. 



APPLICATION NO.: 09/913,064 

I. A. APPLICATION NO.: PCT/US00/03453 

LA. FILING DATE: FEBRUARY 9, 2000 

CONFIRMATION NO.: 3445 

FOR: UDP-GLUCOSE MODIFIERS 



CASE NO.: BB-1321-1 

GROUP ART UNIT: UNKNOWN 

EXAMINER: UNKNOWN '.! 



PRELIMINARY AMENDMENT AND RESPONSE TO NOTIFICATION OF 

DEFECTIVE RESPONSE 



Commissioner for Patents 
Box PCT 

Washington, D.C. 20231 
Sir: 

In response to the Notification of Defective Response mailed March 13, 2002 
amend the application as follows and consider the following remarks. 

IN THE SEQUENCE LISTING: 

Please replace the originally filed sequence listing with the enclosed substitu; 
sequence listing. 

REMARKS 

No new matter is believed to be added by the enclosed substitute sequence 1 
Please charge any fees necessitated by this response to Deposit Account 04-1928 / 
E. I. du Pont de Nemours and Company). I 

In view of the foregoing, allowance of the above-referenced application is ~ 
solicited. 

Respectfully submitted, 





PAUL D. GOLIAN 
ATTORNEY FOR APPLICANTS 
REGISTRATION NO. 42, 591 
TELEPHONE: 302-992-3749 
FACSIMILE: 302-892-1026 



Dated: 



°2 



Rev. 10/93 




In the Application of: 



APPLICATION NO.: UNKNOWN 



O. FAMODU ET AL. 



CASE NO.: BB1321-1 

V 

GROUP ART UNIT: UNKNOWN 



FILED: CONCURRENTLY HEREWITH 



EXAMINER: UNKOWN 



FOR: UDP-GLUCOSE MODIFIERS 



PRELIMINARY AMENDMENT 



Assistant Commissioner for Patents 
Washington, DC 20231 



Sir: 



Before examination of the above-referenced application, please amend the application 
as follows: 

IN THE SPECIFICATION : 

Please replace the following paragraphs: 
Paragraph starting at page 6, line 15: 

For example, it is well known in the art that antisense suppression and co-suppression 
of gene expression may be accomplished using nucleic acid fragments representing less than 
the entire coding region of a gene, and by nuclbic acid fragments that do not share 100% 
sequence identity with the gene to be suppressed. Moreover, alterations in a nucleic acid 
fragment which result in the production of a chemically equivalent amino acid at a given site, 
but do not affect the functional properties of the encoded polypeptide, are well known in the 
art. Thus, a codon for the amino acid alanine, a hydrophobic amino acid, may be substituted 
by a codon encoding another less hydrophobic residue, such as glycine, or a more 
hydrophobic residue, such as valine, leucine, or isoleucine. Similarly, changes which result in 
substitution of one negatively charged residue for another, such as aspartic acid for glutamic 
acid, or one positively charged residue for another, such as lysine for arginine, can also be 
expected to produce a functionally equivalent product. Nucleotide changes which result in 
alteration of the N-terminal and C -terminal portions of the polypeptide molecule would also 
not be expected to alter the activity of the polypeptide. Each of the proposed modifications is 
well within the routine skill in the art, as is determination of retention of biological activity of 
the encoded products. Consequently, an isolated polynucleotide comprising a nucleotide 
sequence of at least one of 30 (preferably at least one of 40, most preferably at least one of 
60) contiguous nucleotides derived from a nucleotide sequence selected from the group 



consisting of (a) SEQ ID NOs:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 25, 27, 29, 31, 33, 35, 37, 39, 41, 
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43, 45, 47, 49, and 51 and the complement of such nucleotide sequences, and/or (b) SEQ ID 
NOs:19, 21, 23, 53, 55, and 57 and the complement of such nucleotide sequences may be 
used in methods of selecting an isolated polynucleotide that affects the expression of an 
LPAAT isozyme polypeptide in a host cell. A method of selecting an isolated polynucleotide 
that affects the level of expression of a polypeptide in a host cell (eukaryotic, such as plant or 
yeast, prokaryotic such as bacterial, or viral) may comprise the steps of: constructing an 
isolated polynucleotide of the present invention or an isolated chimeric gene of the present 
invention; introducing the isolated polynucleotide or the isolated chimeric gene into a host 
cell; measuring the level a polypeptide in the host cell containing the isolated polynucleotide; 
and comparing the level of a polypeptide in the host cell containing the isolated 
polynucleotide with the level of a polypeptide in a host cell that does not contain the isolated 
polynucleotide. 

Paragraph starting at page 8, line 28: 

"Codon degeneracy" refers to divergence in the genetic code permitting variation of the 
nucleotide sequence without affecting the amino acid sequence of an encoded polypeptide. 
Accordingly, the instant invention relates to any nucleic acid fragment comprising a 
nucleotide sequence that encodes all or a substantial portion of the amino acid sequences set 
forth herein. The skilled artisan is well aware of the "codon-bias" exhibited by a specific host 
cell in usage of nucleotide codons to specify a given amino acid. Therefore, when 
synthesizing a nucleic acid fragment for improved expression in a host cell, it is desirable to 
design the nucleic acid fragment such that its frequency of codon usage approaches the 
frequency of preferred codon usage of the host cell. 

In the Claims: 

Please cancel claims 1-22. 

Please add the following new claims: 

--23. An isolated polynucleotide that encodes a UDP-galactose 4-epimerase polypeptide 
having a sequence identity of at least 95% based on the Clustal method of alignment 
when compared to a polypeptide selected, from the group consisting of SEQ ID NOs: 
2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22; and 24. 

24. The polynucleotide of Claim 23 wherein the polynucleotide encodes a polypeptide 
selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, and 24. 

25. An isolated complement of the polynucleotide of Claim 23, wherein (a) the 
complement and the polynucleotide consist of the same number of nucleotides, and (b) 
the nucleotide sequences of the complement and the polynucleotide have 100% 
complementarity. 
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26. An isolated nucleic acid molecule that encodes a UDP-galactose 4-epimerase 
polypeptide and remains hybridized with the isolated polynucleotide of Claim 23 
under a wash condition of 0.1X SSC, 0.1% SDS, and 65°C. 

27. A cell or a virus comprising the polynucleotide of Claim 23. 

28. The cell of Claim 27, wherein the cell is selected from the group consisting of a yeast 
cell, a bacterial cell, an insect cell, and a plant cell. 

29. A transgenic plant comprising the polynucleotide of Claim 23. 

30. A method for transforming a cell comprising introducing into a cell the polynucleotide 
of Claim 23. 

31. A method for producing a transgenic plant comprising (a) transforming a plant cell 
with the polynucleotide of Claim 23, and (b) regenerating a plant from the transformed 
plant cell. 

32. An isolated a UDP-galactose 4-epimerase polypeptide having a sequence identity of at 
least 80% based on the Clustal method compared to an amino acid sequence selected 
from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, and 
24. 

33. The isolated polypeptide of Claim 32 wherein the polypeptide has a sequence selected 
from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, and 

24. 

34. A chimeric gene comprising the polynucleotide of Claim 23 operably linked to at least 
one suitable regulatory sequence. 

35. The chimeric gene of Claim 34, wherein the chimeric gene is an expression vector. 

36. A method for altering the level of a UDP-galactose 4-epimerase polypeptide 
expression in a host cell, the method comprising: 

(a) Transforming a host cell with the chimeric gene of claim 34; and 

(b) Growing the transformed cell in step (a) under conditions suitable for the 
expression of the chimeric gene. — 
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Remarks 



Applicants respectfully submit that the amendment to the Specification only corrects 
obvious typographical errors. Furthermore, applicants submit that newly added claims more 
clearly and distinct recite that which applicants consider to be their invention, and are 
adequately supported by the original disclosure. 

No new matter is believed to be at issue. Entry of the amendments and early favorable 
consideration of the claims on the merits are hereby respectfully requested. 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 

In showing the changes, deleted material is shown as bolded brackets and stricken 
through, and inserted material is shown underlined. 

IN THE SPECIFICATION: 

Please replace the following paragraphs: 
Paragraph starting at page 6, line 15: 

For example, it is well known in the art that antisense suppression and co-suppression 
of gene expression may be accomplished using nucleic acid fragments representing less than 
the entire coding region of a gene, and by nucleic acid fragments that do not share 100% 
sequence identity with the gene to be suppressed. Moreover, alterations in a nucleic acid 
fragment which result in the production of a chemically equivalent amino acid at a given site, 
but do not affect [ effect ] the functional properties of the encoded polypeptide, are well known 
in the art. Thus, a codon for the amino acid alanine, a hydrophobic amino acid, may be 
substituted by a codon encoding another less hydrophobic residue, such as glycine, or a more 
hydrophobic residue, such as valine, leucine, or isoleucine. Similarly, changes which result in 
substitution of one negatively charged residue for another, such as aspartic acid for glutamic 
acid, or one positively charged residue for another, such as lysine for arginine, can also be 
expected to produce a functionally equivalent product. Nucleotide changes which result in 
alteration of the N-terminal and C -terminal portions of the polypeptide molecule would also 
not be expected to alter the activity of the polypeptide. Each of the proposed modifications is 
well within the routine skill in the art, as is determination of retention of biological activity of 
the encoded products. Consequently, an isolated polynucleotide comprising a nucleotide 
sequence of at least one of [60] 30 (preferably at least one of 40, most preferably at least one 
of [30] 60 ) contiguous nucleotides derived from a nucleotide sequence selected from the 
group consisting of (a) SEQ ID NOs:l, 3, 5, 7, 9, 11, 13, 15, 17, 25, 27, 29, 31, 33, 35, 37, 39, 
41 9 43 5 45 ? 47 ? 49 ? and 51 and the complement of such nucleotide sequences, and/or (b) SEQ 
ID NOs:19, 21, 23, 53, 55, and 57 and the complement of such nucleotide sequences may be 
used in methods of selecting an isolated polynucleotide that affects the expression of an 
LPAAT isozyme polypeptide in a host cell. A method of selecting an isolated polynucleotide 
that affects the level of expression of a polypeptide in a host cell (eukaryotic, such as plant or 
yeast, prokaryotic such as bacterial, or viral) may comprise the steps of: constructing an 
isolated polynucleotide of the present invention' or an isolated chimeric gene of the present 
invention; introducing the isolated polynucleotide or the isolated chimeric gene into a host 
cell; measuring the level a polypeptide in the host cell containing the isolated polynucleotide; 
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and comparing the level of a polypeptide in the host cell containing the isolated 
polynucleotide with the level of a polypeptide in a host cell that does not contain the isolated 
polynucleotide. 

Paragraph starting at page 8, line 28: ? 

"Codon degeneracy" refers to divergence in the genetic code permitting variation of the 
nucleotide sequence without affecting [e ff e cting ] the amino acid sequence of an encoded 
polypeptide. Accordingly, the instant invention relates to any nucleic acid fragment 
comprising a nucleotide sequence that encodes all or a substantial portion of the amino acid 
sequences set forth herein. The skilled artisan is well aware of the "codon-bias" exhibited by 
a specific host cell in usage of nucleotide codons to specify a given amino acid. Therefore, 
when synthesizing a nucleic acid fragment for improved expression in a host cell, it is 
desirable to design the nucleic acid fragment such that its frequency of codon usage of the 
host cell. 
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This application claims the benefit of U.S. Provisional Application No. 60/1 19,588, 
filed February 10, 1999. 



This invention is in the field of plant molecular biology. More specifically, this 
invention pertains to nucleic acid fragments encoding UDP-glucose modifiers in plants and 
seeds. 



Raffinose saccharides are a group of D-galactose-containing oligosaccharides of 
sucrose that are widely distributed in plants. Raffinose saccharides are characterized by 
having the general formula: [0-a-D-galactopyranosyl-(l->6) n -a-glucopyranosyl- 
(l-^2)-P-D-fructofuranoside where n=0 through n=4 are known respectively as sucrose, 
raffinose, stachyose, verbascose, and ajugose. The biosynthesis of raffinose saccharides has 
been fairly well characterized [see Dey, P. M. In Biochemistry of Storage Carbohydrates in 
Green Plants (1985)]. The committed reaction of raffinose saccharide biosynthesis involves 
the synthesis of galactinol (O-a-D-galactopyranosyl-Cl-^l)-/^^- inositol) from 
UDP-galactose and rayo-inositol. The enzyme that catalyzes this reaction is galactinol 
synthase. The flux of carbon through this reaction is controlled by the concentrations of the 
two substrates for the enzyme. Thus, while they are not unique to the raffinosaccharide 
pathway, the enzymes which produce these substrates serve to limit carbon flux to the 
raffino saccharides . 

UDP-glucose 4-epimerase (EC 5.1.3.2) is also called UDP-galactose 4-epimerase. It is 
responsible for the interconversion of UDG-glucose and UDP-galactose. UDP-galactose is a 
precursor of galactolipids and cell wall polysaccharides. When transgenic Arabidopsis 
plants expressing the UDP-glucose 4-epimerase gene in sense or antisense orientation are 
grown in soil, no changes in morphology or relative amounts of different galactose- 
containing compounds are detected. When the plants are grown on agar plates in the 
presence of galactose, a decrease in enzyme activity and an increase in the UDP-galactose 
content correlates with a repression of growth while the UDP-glucose content does not 
change. Changes in the amount of galactose in the cell wall is detected in plants with low 
UDP-Glucose epimerase activity grown on galactose, while there is no change in the 
cellulose content of the leaves (Dormann and Benning (1998) Plant J. 73:641-652). 

The activity of UDP-glucose 4-epimerase appears to be particularly limiting to carbon 
flux into the raffinosaccharide pathway, therefore further reduction of the activity of this 
enzyme by tissue- and temporally-specific gene silencing should greatly decrease the levels 
of raffinose and stachyose in seeds. 



FIELD OF THE INVENTION 



BACKGROUND OF THE INVENTION 
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Changes in the expression of either UDP-glucose 4-epimerase will allow the 
modification of the carbohydrate metabolism in transgenic plants. Modification of the 
expression of UDP-glucose 4-epimerase may result in grains with reduced cell-wall 
constituents (fiber) and increased levels of starch. This trait will add value for feed, food, 
5 and industrial applications of the crops. 

SUMMARY OF THE INVENTION 
Changes in the expression of UDP-glucose 4-epimerase will allow the modification of 
the carbohydrate metabolism in transgenic plants. Modification of the expression of 
UDP-glucose 4-epimerase may result in grains with reduced cell- wall constituents (fiber) 
10 and increased levels of starch. This trait will add value for feed, food, and industrial 

applications of the crops. For example, overexpression of UDP-glucose 4-epimerase in 
soybean should yield crops with lower contents of raffinose and stachyose and with 
significantly higher contents of sucrose. 

The present invention relates to isolated polynucleotides comprising a nucleotide 
15 sequence encoding a polypeptide of at least 90 amino acids that has at least 95% identity 
based on the Clustal method of alignment when compared to a UDP-galactose 4-epimerase 
polypeptide selected from the group consisting of SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 16, 18, 
20, 22, and 24. The present invention also relates to an isolated polynucleotide comprising 
the complement of the nucleotide sequences described above. 
20 It is preferred that the isolated polynucleotide of the claimed invention consists of a 

nucleic acid sequence selected from the group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 1 1, 
13, 15, 1 7, 19, 21, and 23 that codes for the polypeptide selected from the group consisting 
of SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, and 24. The present invention also 
relates to an isolated polynucleotide comprising a nucleotide sequences of at least one of 60 
25 (preferably at least one of 40, most preferably at least one of 30) contiguous nucleotides 

derived from a nucleotide sequence selected from the group consisting of SEQ ID NOs:l, 3, 
5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23 and the complement of such nucleotide sequences. 

The present invention relates to a chimeric gene comprising an isolated polynucleotide 
of the present invention operably linked to suitable regulatory sequences. 
30 The present invention relates to an isolated host cell comprising a chimeric gene of the 

present invention or an isolated polynucleotide of the present invention. The host cell may 
be eukaryotic, such as a yeast or a plant cell, or prokaryotic, such as a bacterial cell. The 
present invention also relates to a virus, preferably a baculovirus, comprising an isolated 
polynucleotide of the present invention or a chimeric gene of the present invention. 
35 The present invention relates to a process for producing an isolated host cell 

comprising a chimeric gene of the present invention or an isolated polynucleotide of the 
present invention, the process comprising either transforming or transfecting an isolated 
compatible host cell with a chimeric gene or isolated polynucleotide of the present invention. 

2 
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The present invention relates to a UDP-galactose 4-epimerase polypeptide of at least 
90 amino acids comprising at least 95% homology based on the Clustal method of alignment 
compared to a polypeptide selected from the group consisting of SEQ ID NOs:2, 4, 6, 8, 10, 
12, 14, 16, 18, 20, 22, and 24. 
5 The present invention relates to a method of selecting an isolated polynucleotide that 

affects the level of expression of a UDP-galactose 4-epimerase polypeptide in a host cell, 
preferably a plant cell, the method comprising the steps of: (a) constructing an isolated 
polynucleotide of the present invention or an isolated chimeric gene of the present invention;, 
(b) introducing the isolated polynucleotide or the isolated chimeric gene into a host cell; 
10 (c) measuring the level a UDP-galactose 4-epimerase polypeptide in the host cell containing 
the isolated polynucleotide; and (d) comparing the level of a UDP-galactose 4-epimerase 
polypeptide in the host cell containing the isolated polynucleotide with the level of a 
UDP-galactose 4-epimerase polypeptide in the host cell that does not contain the isolated 
polynucleotide. 

15 The present invention relates to a method of obtaining a nucleic acid fragment 

encoding a substantial portion of a UDP-galactose 4-epimerase polypeptide, preferably a 
plant UDP-galactose 4-epimerase polypeptide, comprising the steps of: synthesizing an 
oligonucleotide primer comprising a nucleotide sequence of at least one of 60 (preferably at 
least one of 40, most preferably at least one of 30) contiguous nucleotides derived from a 

20 nucleotide sequence selected from the group consisting of SEQ ID NOs: 1 , 3, 5, 7, 9, 1 1 , 1 3, 
15, 17, 19, 21, 23 and the complement of such nucleotide sequences; and amplifying a 
nucleic acid fragment (preferably a cDNA inserted in a cloning vector) using the 
oligonucleotide primer. The amplified nucleic acid fragment preferably will encode a 
portion of a UDP-galactose 4-epimerase amino acid sequence. 

25 The present invention also relates to a method of obtaining a nucleic acid fragment 

encoding all or a substantial portion of the amino acid sequence encoding a UDP-galactose 
4-epimerase polypeptide comprising the steps of: probing a cDNA or genomic library with 
an isolated polynucleotide of the present invention; identifying a DNA clone that hybridizes 
with an isolated polynucleotide of the present invention; isolating the identified DNA clone; 

30 and sequencing the cDNA or genomic fragment that comprises the isolated DNA clone. 

The present invention relates to a composition, such as a hybridization mixture, 
comprising an isolated polynucleotide of the present invention. 

The present invention relates to an isolated polynucleotide of the present invention 
comprising at least one of 30 contiguous nucleotides derived from a nucleic acid sequence 

35 selected from the group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 
and the complement of such sequences. 

The present invention relates to an expression cassette comprising an isolated 
polynucleotide of the present invention operably linked to a promoter. 

3 
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The present invention relates to a method for positive selection of a transformed cell 
comprising; (a) transforming a host cell with the chimeric gene of the present invention or 
an expression cassette of the present invention; and (b) growing the transformed host cell, 
preferably plant cell, such as a monocot or a dicot, under conditions which allow expression 
5 of the UDP-galactose 4-epimerase polynucleotide in an amount sufficient to complement a 
null mutant to provide a positive selection means. 

RKTEP DESCRIPTION OF THE DRAWING 
AND SEQUENCE DESCRIPTIONS 
The invention can be more fully understood from the following detailed description 
10 and the accompanying drawing and Sequence Listing which form a part of this application. 
Figure 2 shows a comparison of the amino acid sequences of the UDP-glucose 
4-epimerase from soybean clone sls2c.pk017.k22:fis (SEQ ID NO:14), wheat clone 
wdk5c.pk006.o4:fis (SEQ ID NO:16), corn clone cen3n.pk0155.b8:fis (SEQ ID NO:18) ? rice 
clone rlr2.pk0043.c3:fis (SEQ ID NO:20), soybean clone se6.pk0014.fl2 (SEQ ID NO:22), 
15 Pisum sativum (NCBI General Identifier No. 1 173555, SEQ ID NO:25) and Cyamopsis 
tetragonoloba (NCBIGeneral Identifier No. 3021357, SEQ ID NO:26). Amino acids 
conserved among all sequences are indicated by an asterisk (*) above the alignment. Dashes 
are used by the program to maximize the alignment. 

Table 1 lists the polypeptides that are described herein, the designation of the cDNA 
20 clones that comprise the nucleic acid fragments encoding polypeptides representing all or a 
substantial portion of these polypeptides, and the corresponding identifier (SEQ ID NO:) as 
used in the attached Sequence Listing. The sequence descriptions and Sequence Listing 
attached hereto comply with the rules governing nucleotide and/or amino acid sequence 
disclosures in patent applications as set forth in 37 C.F.R. §1.821-1.825. 

25 

TABLE 1 

UDP-Galactose 4-Epimerase 

SEQ ID NO: 

Protein Clone Designation (Nucleotide) (Amino Acid) 



Com UDP-Galactose 4-Epimerase 


cen3n.pk0155.b8 


1 


2 


Rice UDP-Galactose 4-Epimerase 


rlr2.pk0043.c3 


3 


4 


Soybean UDP-Galactose 


sls2c.pk017.k22 


5 


6 


4-Epimerase 








Wheat UDP-Galactose 


wdk5c.pk006.o4 


7 


8 


4-Epimerase 








Corn UDP-Galactose 4-Epimerase 


p0083.clddm72r 


9 


10 


Rice UDP-Galactose 4-Epimerase 


rls24.pk0008.dl2 


11 


12 


Soybean UDP-Galactose 


sls2c.pk017.k22:fis 


13 


14 



4-Epimerase 
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SEQ ID NO: 

Protein Clone Designation (Nucleotide) (Amino Acid) 



Wheat UDP-Galactose 


wdk5c.pk006.o4:fis 


15 


16 


4-Epimerase 








Com UDP-Galactose 4-Epimerase 


cen3n.pk0 1 5 5.b8 :fis 


17 


18 


Rice UDP-Galactose 4-Epimerase 


rlr2.pk0043.c3:fis 


19 


20 


Soybean UDP-Galactose 


se6.pk0014.fl2 


21 


22 


4-Epimerase 








Wheat UDP-Galactose 


wlm0.pk0015.g3 


23 


24 


4-Epimerase 









The Sequence Listing contains the one letter code for nucleotide sequence characters 
and the three letter codes for amino acids as defined in conformity with the IUPAC-IUBMB 
5 standards described in Nucleic Acids Res. 75:3021-3030 (1985) and in the Biochemical J. 
219 (No. 2^:345-373 (1984) which are herein incorporated by reference. The symbols and 
format used for nucleotide and amino acid sequence data comply with the rules set forth in 
37C.F.R. §1.822. 

DETAILED DESCRIPTION OF THE INVENTION 
10 In the context of this disclosure, a number of terms shall be utilized. As used herein, a 

"polynucleotide" is a nucleotide sequence such as a nucleic acid fragment. A polynucleotide 
may be a polymer of RNA or DNA that is single- or double-stranded, that optionally 
contains synthetic, non-natural or altered nucleotide bases. A polynucleotide in the form of 
a polymer of DNA may be comprised of one or more segments of cDNA, genomic DNA, 
15 synthetic DNA, or mixtures thereof. An isolated polynucleotide of the present invention 
may include at least one of 60 contiguous nucleotides, preferably at least one of 40 
contiguous nucleotides, most preferably one of at least 30 contiguous nucleotides derived 
from SEQ IDNOs:l,3, 5,7, 9, 11, 13, 15, 17, 19, 21, 23, or the complement of such 
sequences. 

20 As used herein, "substantially similar" refers to nucleic acid fragments wherein 

changes in one or more nucleotide bases results in substitution of one or more amino acids, 
but do not affect the functional properties of the polypeptide encoded by the nucleotide 
sequence. "Substantially similar" also refers to nucleic acid fragments wherein changes in 
one or more nucleotide bases does not affect the ability of the nucleic acid fragment to 

25 mediate alteration of gene expression by gene silencing through for example antisense or co- 
suppression technology. "Substantially similar" also refers to modifications of the nucleic 
acid fragments of the instant invention such as deletion or insertion of one or more 
nucleotides that do not substantially affect the functional properties of the resulting 
transcript vis-a-vis the ability to mediate gene silencing or alteration of the functional 
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properties of the resulting protein molecule. It is therefore understood that the invention 
encompasses more than the specific exemplary nucleotide or amino acid sequences and 
includes functional equivalents thereof. 

Substantially similar nucleic acid fragments may be selected by screening nucleic acid 
5 fragments representing subfragments or modifications of the nucleic acid fragments of the 
instant invention, wherein one or more nucleotides are substituted, deleted and/or inserted, 
for their ability to affect the level of the polypeptide encoded by the unmodified nucleic acid 
fragment in a plant or plant cell. For example, a substantially similar nucleic acid fragment 
representing at least one of 30 contiguous nucleotides derived from the instant nucleic acid 
10 fragment can be constructed and introduced into a plant or plant cell. The level of the 

polypeptide encoded by the unmodified nucleic acid fragment present in a plant or plant cell 
exposed to the substantially similar nucleic fragment can then be compared to the level of 
the polypeptide in a plant or plant cell that is not exposed to the substantially similar nucleic 
acid fragment. 

15 For example, it is well known in the art that antisense suppression and co-suppression 

of gene expression may be accomplished using nucleic acid fragments representing less than 
the entire coding region of a gene, and by nucleic acid fragments that do not share 100% 
sequence identity with the gene to be suppressed. Moreover, alterations in a nucleic acid 
fragment which result in the production of a chemically equivalent amino acid at a given 

20 site, but do not effect the functional properties of the encoded polypeptide, are well known in 
the art. Thus, a codon for the amino acid alanine, a hydrophobic amino acid, may be 
substituted by a codon encoding another less hydrophobic residue, such as glycine, or a more 
hydrophobic residue, such as valine, leucine, or isoleucine. Similarly, changes which result 
in substitution of one negatively charged residue for another, such as aspartic acid for 

25 glutamic acid, or one positively charged residue for another, such as lysine for arginine, can 
also be expected to produce a functionally equivalent product. Nucleotide changes which 
result in alteration of the N-terminal and C-terminal portions of the polypeptide molecule 
would also not be expected to alter the activity of the polypeptide. Each of the proposed 
modifications is well within the routine skill in the art, as is determination of retention of 

30 biological activity of the encoded products. Consequently, an isolated polynucleotide 

comprising a nucleotide sequence of at least one of 60 (preferably at least one of 40, most 
preferably at least one of 30) contiguous nucleotides derived from a nucleotide sequence 
selected from the group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 
and the complement of such nucleotide sequences may be used in methods of selecting an 

35 isolated polynucleotide that affects the expression of a UDP-galactose 4-epimerase 

polypeptide in a host cell. A method of selecting an isolated polynucleotide that affects the 
level of expression of a polypeptide in a host cell (eukaryotic, such as plant or yeast, 
prokaryotic such as bacterial, or viral) may comprise the steps of: constructing an isolated 
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polynucleotide of the present invention or an isolated chimeric gene of the present invention; 
introducing the isolated polynucleotide or the isolated chimeric gene into a host cell; 
measuring the level a polypeptide in the host cell containing the isolated polynucleotide; and 
comparing the level of a polypeptide in the host cell containing the isolated polynucleotide 
5 with the level of a polypeptide in a host cell that does not contain the isolated 
polynucleotide . 

Moreover, substantially similar nucleic acid fragments may also be characterized by 
their ability to hybridize. Estimates of such homology are provided by either DNA-DNA or 
DNA-RNA hybridization under conditions of stringency as is well understood by those 

10 skilled in the art (Hames and Higgins, Eds. (1985) Nucleic Acid Hybridisation, IRL Press, 
Oxford, U.K.). Stringency conditions can be adjusted to screen for moderately similar 
fragments, such as homologous sequences from distantly related organisms, to highly similar 
fragments, such as genes that duplicate functional enzymes from closely related organisms. 
Post-hybridization washes determine stringency conditions. One set of preferred conditions 

15 uses a series of washes starting with 6X SSC, 0.5% SDS at room temperature for 15 min, 
then repeated with 2X SSC, 0.5% SDS at 45°C for 30 min, and then repeated twice with 
0.2X SSC, 0.5% SDS at 50°C for 30 min. A more preferred set of stringent conditions uses 
higher temperatures in which the washes are identical to those above except for the 
temperature of the final two 30 min washes in 0.2X SSC, 0.5% SDS was increased to 60°C. 

20 Another preferred set of highly stringent conditions uses two final washes in 0.1X SSC, 
0.1% SDS at 65°C. 

Substantially similar nucleic acid fragments of the instant invention may also be 
characterized by the percent identity of the amino acid sequences that they encode to the 
amino acid sequences disclosed herein, as determined by algorithms commonly employed by 

25 those skilled in this art. Suitable nucleic acid fragments (isolated polynucleotides of the 
present invention) encode polypeptides that are at least about 70% identical, preferably at 
least about 80% identical to the amino acid sequences reported herein. Preferred nucleic 
acid fragments encode amino acid sequences that are about 85% identical to the amino acid 
sequences reported herein. More preferred nucleic acid fragments encode amino acid 

30 sequences that are at least about 90% identical to the amino acid sequences reported herein. 
Most preferred are nucleic acid fragments that encode amino acid sequences that are at least 
about 95% identical to the amino acid sequences reported herein. Suitable nucleic acid 
fragments not only have the above homologies but typically encode a polypeptide having at 
least 50 amino acids, preferably at least 100 amino acids, more preferably at least 150 amino 

35 acids, still more preferably at least 200 amino acids, and most preferably at least 250 amino 
acids. Sequence alignments and percent identity calculations were performed using the 
Megalign program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., 
Madison, WI). Multiple alignment of the sequences was performed using the Clustal 
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method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the default 
parameters (GAP PEN ALT Y= 10, GAP LENGTH PEN ALT Y= 10). Default parameters for 
pairwise alignments using the Clustal method were KTUPLE 1, GAP PENALTY=3, 
WINDOW-5 and DIAGONALS SAVED=5. 
5 A "substantial portion" of an amino acid or nucleotide sequence comprises an amino 

acid or a nucleotide sequence that is sufficient to afford putative identification of the protein 
or gene that the amino acid or nucleotide sequence comprises. Amino acid and nucleotide 
sequences can be evaluated either manually by one skilled in the art, or by using computer- 
based sequence comparison and identification tools that employ algorithms such as BLAST 

10 (Basic Local Alignment Search Tool; Altschul et al. (1993) J. Mol Biol 275:403-410; see 
also www.ncbi.nlm.nih.gov/BLAST/). In general, a sequence of ten or more contiguous 
amino acids or thirty or more contiguous nucleotides is necessary in order to putatively 
identify a polypeptide or nucleic acid sequence as homologous to a known protein or gene. 
Moreover, with respect to nucleotide sequences, gene-specific oligonucleotide probes 

15 comprising 30 or more contiguous nucleotides may be used in sequence-dependent methods 
of gene identification (e.g., Southern hybridization) and isolation (e.g., in situ hybridization 
of bacterial colonies or bacteriophage plaques). In addition, short oligonucleotides of 12 or 
more nucleotides may be used as amplification primers in PCR in order to obtain a particular 
nucleic acid fragment comprising the primers. Accordingly, a "substantial portion" of a 

20 nucleotide sequence comprises a nucleotide sequence that will afford specific identification 
and/or isolation of a nucleic acid fragment comprising the sequence. The instant 
specification teaches amino acid and nucleotide sequences encoding polypeptides that 
comprise one or more particular plant proteins. The skilled artisan, having the benefit of the 
sequences as reported herein, may now use all or a substantial portion of the disclosed 

25 sequences for purposes known to those skilled in this art. Accordingly, the instant invention 
comprises the complete sequences as reported in the accompanying Sequence Listing, as 
well as substantial portions of those sequences as defined above. 

"Codon degeneracy" refers to divergence in the genetic code permitting variation of 
the nucleotide sequence without effecting the amino acid sequence of an encoded 

30 polypeptide. Accordingly, the instant invention relates to any nucleic acid fragment 

comprising a nucleotide sequence that encodes all or a substantial portion of the amino acid 
sequences set forth herein. The skilled artisan is well aware of the "codon-bias" exhibited 
by a specific host cell in usage of nucleotide codons to specify a given amino acid. 
Therefore, when synthesizing a nucleic acid fragment for improved expression in a host cell, 

35 it is desirable to design the nucleic acid fragment such that its frequency of codon usage 
approaches the frequency of preferred codon usage of the host cell. 

"Synthetic nucleic acid fragments" can be assembled from oligonucleotide building 
blocks that are chemically synthesized using procedures known to those skilled in the art. 
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These building blocks are ligated and annealed to form larger nucleic acid fragments which 
may then be enzymatically assembled to construct the entire desired nucleic acid fragment. 
"Chemically synthesized", as related to nucleic acid fragment, means that the component 
nucleotides were assembled in vitro. Manual chemical synthesis of nucleic acid fragments 
may be accomplished using well established procedures, or automated chemical synthesis 
can be performed using one of a number of commercially available machines. Accordingly, 
the nucleic acid fragments can be tailored for optimal gene expression based on optimization 
of nucleotide sequence to reflect the codon bias of the host cell. The skilled artisan 
appreciates the likelihood of successful gene expression if codon usage is biased towards 
those codons favored by the host. Determination of preferred codons can be based on a 
survey of genes derived from the host cell where sequence information is available. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, including 
regulatory sequences preceding (5* non-coding sequences) and following (3' non-coding 
sequences) the coding sequence. "Native gene" refers to a gene as found in nature with its 
own regulatory sequences. "Chimeric gene" refers any gene that is not a native gene, 
comprising regulatory and coding sequences that are not found together in nature. 
Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that 
are derived from different sources, or regulatory sequences and coding sequences derived 
from the same source, but arranged in a manner different than that found in nature. 
"Endogenous gene" refers to a native gene in its natural location in the genome of an 
organism. A "foreign" gene refers to a gene not normally found in the host organism, but 
that is introduced into the host organism by gene transfer. Foreign genes can comprise 
native genes inserted into a non-native organism, or chimeric genes. A "transgene" is a gene 
that has been introduced into the genome by a transformation procedure. 

"Coding sequence" refers to a nucleotide sequence that codes for a specific amino acid 
sequence. "Regulatory sequences" refer to nucleotide sequences located upstream (5* non- 
coding sequences), within, or downstream (3* non-coding sequences) of a coding sequence, 
and which influence the transcription, RNA processing or stability, or translation of the 
associated coding sequence. Regulatory sequences may include promoters, translation 
leader sequences, introns, and polyadenylation recognition sequences. 

"Promoter" refers to a nucleotide sequence capable of controlling the expression of a 
coding sequence or functional RNA. In general, a coding sequence is located 3' to a 
promoter sequence. The promoter sequence consists of proximal and more distal upstream 
elements, the latter elements often referred to as enhancers. Accordingly, an "enhancer" is a 
nucleotide sequence which can stimulate promoter activity and may be an innate element of 
the promoter or a heterologous element inserted to enhance the level or tissue-specificity of 
a promoter. Promoters may be derived in their entirety from a native gene, or be composed 
of different elements derived from different promoters found in nature, or even comprise 
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synthetic nucleotide segments. It is understood by those skilled in the art that different 
promoters may direct the expression of a gene in different tissues or cell types, or at 
different stages of development, or in response to different environmental conditions. 
Promoters which cause a nucleic acid fragment to be expressed in most cell types at most 
5 times are commonly referred to as "constitutive promoters". New promoters of various 
types useful in plant cells are constantly being discovered; numerous examples may be 
found in the compilation by Okamuro and Goldberg (1989) Biochemistry of Plants 75:1-82. 
It is further recognized that since in most cases the exact boundaries of regulatory sequences 
have not been completely defined, nucleic acid fragments of different lengths may have 

10 identical promoter activity. 

The "translation leader sequence" refers to a nucleotide sequence located between the 
promoter sequence of a gene and the coding sequence. The translation leader sequence is 
present in the fully processed mRNA upstream of the translation start sequence. The 
translation leader sequence may affect processing of the primary transcript to mRNA, 

15 mRNA stability or translation efficiency. Examples of translation leader sequences have 
been described (Turner and Foster (1995) Mol Biotechnol 3:225-236). 

The "3' non-coding sequences" refer to nucleotide sequences located downstream of a 
coding sequence and include polyadenylation recognition sequences and other sequences 
encoding regulatory signals capable of affecting mRNA processing or gene expression. The 

20 polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid 
tracts to the 3' end of the mRNA precursor. The use of different 3* non-coding sequences is 
exemplified by Ingelbrecht et al. (1989) Plant Cell 7:671-680. 

"RN A transcript" refers to the product resulting from RNA polymerase-catalyzed 
transcription of a DNA sequence. When the RNA transcript is a perfect complementary 

25 copy of the DNA sequence, it is referred to as the primary transcript or it may be a RNA 
sequence derived from posttranscriptional processing of the primary transcript and is 
referred to as the mature RNA. "Messenger RNA (mRNA)" refers to the RNA that is 
without introns and that can be translated into polypeptide by the cell. "cDNA" refers to a 
double-stranded DNA that is complementary to and derived from mRNA. "Sense" RNA 

30 refers to an RNA transcript that includes the mRNA and so can be translated into a 
polypeptide by the cell. "Antisense RNA" refers to an RNA transcript that is 
complementary to all or part of a target primary transcript or mRNA and that blocks the 
expression of a target gene (see U.S. Patent No. 5,107,065, incorporated herein by 
reference). The complementarity of an antisense RNA may be with any part of the specific 

35 nucleotide sequence, i.e., at the 5 f non-coding sequence, 3 1 non-coding sequence, introns, or 
the coding sequence. "Functional RNA" refers to sense RNA, antisense RNA, ribozyme 
RNA, or other RNA that may not be translated but yet has an effect on cellular processes. 
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The term "operably linked" refers to the association of two or more nucleic acid 
fragments on a single nucleic acid fragment so that the function of one is affected by the 
other. For example, a promoter is operably linked with a coding sequence when it is capable 
of affecting the expression of that coding sequence (i.e., that the coding sequence is under 
5 the transcriptional control of the promoter). Coding sequences can be operably linked to 
regulatory sequences in sense or antisense orientation. 

The term "expression", as used herein, refers to the transcription and stable 
accumulation of sense (mRNA) or antisense RNA derived from the nucleic acid fragment of 
the invention. Expression may also refer to translation of mRNA into a polypeptide. 

10 "Antisense inhibition" refers to the production of antisense RNA transcripts capable of 

suppressing the expression of the target protein. "Overexpression" refers to the production 
of a gene product in transgenic organisms that exceeds levels of production in normal or 
non-transformed organisms. "Co-suppression" refers to the production of sense RNA 
transcripts capable of suppressing the expression of identical or substantially similar foreign 

15 or endogenous genes (U.S. Patent No. 5,23 1 ,020, incorporated herein by reference). 

"Altered levels" refers to the production of gene product(s) in transgenic organisms in 
amounts or proportions that differ from that of normal or non-transformed organisms. 

"Mature" protein refers to a post-translationally processed polypeptide; i.e., one from 
which any pre- or propeptides present in the primary translation product have been removed. 

20 "Precursor" protein refers to the primary product of translation of mRNA; i.e., with pre- and 
propeptides still present. Pre- and propeptides may be but are not limited to intracellular 
localization signals. 

A "chloroplast transit peptide" is an amino acid sequence which is translated in 
conjunction with a protein and directs the protein to the chloroplast or other plastid types 

25 present in the cell in which the protein is made. "Chloroplast transit sequence" refers to a 
nucleotide sequence that encodes a chloroplast transit peptide. A "signal peptide" is an 
amino acid sequence which is translated in conjunction with a protein and directs the protein 
to the secretory system (Chrispeels (1991) Ann. Rev. Plant Phys. Plant Mol. Biol 42:21-53). 
If the protein is to be directed to a vacuole, a vacuolar targeting signal {supra) can further be 

30 added, or if to the endoplasmic reticulum, an endoplasmic reticulum retention signal (supra) 
may be added. If the protein is to be directed to the nucleus, any signal peptide present 
should be removed and instead a nuclear localization signal included (Raikhel (1992) Plant 
Phys. 700:1627-1632). 

"Transformation" refers to the transfer of a nucleic acid fragment into the genome of a 

35 host organism, resulting in genetically stable inheritance. Host organisms containing the 

transformed nucleic acid fragments are referred to as "transgenic" organisms. Examples of 
methods of plant transformation include Agrobacterium-mediated transformation (De Blaere 
et al. (1987) Meth. Enzymol 143:211) and particle-accelerated or "gene gun" transformation 
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technology (Klein et al. (1987) Nature (London) 327:70-73; U.S. Patent No. 4,945,050, 
incorporated herein by reference). 

Standard recombinant DNA and molecular cloning techniques used herein are well 
known in the art and are described more fully in Sambrook et al. Molecular Cloning: A 
5 Laboratory Manual; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 1989 
(hereinafter "Maniatis")- 

Nucleic acid fragments encoding at least a portion of several UDP-galactose 
4-epimerases have been isolated and identified by comparison of random plant cDNA 
sequences to public databases containing nucleotide and protein sequences using the BLAST 

10 algorithms well known to those skilled in the art. The nucleic acid fragments of the instant 
invention may be used to isolate cDNAs and genes encoding homologous proteins from the 
same or other plant species. Isolation of homologous genes using sequence-dependent 
protocols is well known in the art. Examples of sequence-dependent protocols include, but 
are not limited to, methods of nucleic acid hybridization, and methods of DNA and RNA 

15 amplification as exemplified by various uses of nucleic acid amplification technologies (e.g., 
polymerase chain reaction, ligase chain reaction). 

For example, genes encoding other UDP-galactose 4-epimerases, either as cDNAs or 
genomic DNAs, could be isolated directly by using all or a portion of the instant nucleic acid 
fragments as DNA hybridization probes to screen libraries from any desired plant employing 

20 methodology well known to those skilled in the art. Specific oligonucleotide probes based 
upon the instant nucleic acid sequences can be designed and synthesized by methods known 
in the art (Maniatis). Moreover, the entire sequences can be used directly to synthesize 
DNA probes by methods known to the skilled artisan such as random primer DNA labeling, 
nick translation, or end-labeling techniques, or RNA probes using available in vitro 

25 transcription systems. In addition, specific primers can be designed and used to amplify a 
part or all of the instant sequences. The resulting amplification products can be labeled 
directly during amplification reactions or labeled after amplification reactions, and used as 
probes to isolate full length cDNA or genomic fragments under conditions of appropriate 
stringency. 

30 In addition, two short segments of the instant nucleic acid fragments may be used in 

polymerase chain reaction protocols to amplify longer nucleic acid fragments encoding 
homologous genes from DNA or RNA. The polymerase chain reaction may also be 
performed on a library of cloned nucleic acid fragments wherein the sequence of one primer 
is derived from the instant nucleic acid fragments, and the sequence of the other primer takes 

35 advantage of the presence of the polyadenylic acid tracts to the 3' end of the mRNA 

precursor encoding plant genes. Alternatively, the second primer sequence may be based 
upon sequences derived from the cloning vector. For example, the skilled artisan can follow 
the RACE protocol (Frohman et al. (1988) Proc. Natl Acad. Set USA 55:8998-9002) to 

12 
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generate cDNAs by using PCR to amplify copies of the region between a single point in the 
transcript and the 3' or 5' end. Primers oriented in the 3* and 5' directions can be designed 
from the instant sequences. Using commercially available 3' RACE or 5' RACE systems 
(BRL), specific 3' or 5* cDNA fragments can be isolated (Ohara et al. (1989) Proc. Natl 
5 Acad Set USA 55:5673-5677; Loh et al. (1989) Science 243:217-220), Products generated 
by the 3* and 5* RACE procedures can be combined to generate full-length cDNAs (Frohman 
and Martin (1989) Techniques 7:165). Consequently, a polynucleotide comprising a 
nucleotide sequence of at least one of 60 (preferably one of at least 40, most preferably one 
of at least 30) contiguous nucleotides derived from a nucleotide sequence selected from the 
10 group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, and 23 and the 

complement of such nucleotide sequences may be used in such methods to obtain a nucleic 
acid fragment encoding a substantial portion of an amino acid sequence of a polypeptide. 
The present invention relates to a method of obtaining a nucleic acid fragment encoding a 
substantial portion of a UDP-galactose 4-epimerase polypeptide preferably a substantial 
15 portion of a plant UDP-galactose 4-epimerase polypeptide, comprising the steps of: 

synthesizing an oligonucleotide primer comprising a nucleotide sequence of at least one of 
60 (preferably at least one of 40, most preferably at least one of 30) contiguous nucleotides 
derived from a nucleotide sequence selected from the group consisting of SEQ ID NOs:l, 3, 
5, 7, 9, 11, 13, 15, 17, 19, 21, and 23, and the complement of such nucleotide sequences; and 
20 amplifying a nucleic acid fragment (preferably a cDNA inserted in a cloning vector) using 
the oligonucleotide primer. The amplified nucleic acid fragment preferably will encode a 
portion of a UDP-galactose 4-epimerase polypeptide. 

Availability of the instant nucleotide and deduced amino acid sequences facilitates 
immunological screening of cDNA expression libraries. Synthetic peptides representing 
25 portions of the instant amino acid sequences may be synthesized. These peptides can be 
used to immunize animals to produce polyclonal or monoclonal antibodies with specificity 
for peptides or proteins comprising the amino acid sequences. These antibodies can be then 
be used to screen cDNA expression libraries to isolate full-length cDNA clones of interest 
(Lemer(1984)^6?v. Immunol 36:1-34; Maniatis). 
30 The nucleic acid fragments of the instant invention may be used to create transgenic 

plants in which the disclosed polypeptides are present at higher or lower levels than normal 
or in cell types or developmental stages in which they are not normally found. This would 
have the effect of altering the level of cell wall and starch biosynthesis in those cells. 
Modulation of the expression of UDP-galactose 4-epimerase can be used to control 
35 carbohydrate partitioning between cell wall and starch biosynthesis. Changes in the 

expression of UDP-glucose 4-epimerase will allow the modification of the carbohydrate 
metabolism in transgenic plants. Modification of the expression of UDP-glucose 
4-epimerase may result in grains with reduced cell -wall constituents (fiber) and increased 

13 
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levels of starch. This trait will add value for feed, food, and industrial applications of the 
crops. For example, overexpression of UDP-glucose 4-epimerase in soybean should yield 
crops with lower contents of raffmose and stachyose and with significantly higher contents 
of sucrose. 

5 Overexpression of the proteins of the instant invention may be accomplished by first 

constructing a chimeric gene in which the coding region is operably linked to a promoter 
capable of directing expression of a gene in the desired tissues at the desired stage of 
development. The chimeric gene may comprise promoter sequences and translation leader 
sequences derived from the same genes. 3* Non-coding sequences encoding transcription 
10 termination signals may also be provided. The instant chimeric gene may also comprise one 
or more introns in order to facilitate gene expression. 

Plasmid vectors comprising the isolated polynucleotide (or chimeric gene) may be 
constructed. The choice of plasmid vector is dependent upon the method that will be used to 
transform host plants. The skilled artisan is well aware of the genetic elements that must be 
15 present on the plasmid vector in order to successfully transform, select and propagate host 
cells containing the chimeric gene. The skilled artisan will also recognize that different 
independent transformation events will result in different levels and patterns of expression 
(Jones et al. (1985) EMBO J. 4:241 1-241 8; De Almeida et al. (1 989) MoL Gen. Genetics 
275:78-86), and thus that multiple events must be screened in order to obtain lines 
20 displaying the desired expression level and pattern. Such screening may be accomplished by 
Southern analysis of DNA, Northern analysis of mRNA expression, Western analysis of 
protein expression, or phenotypic analysis. 

For some applications it may be useful to direct the instant polypeptides to different 
cellular compartments, or to facilitate its secretion from the cell. It is thus envisioned that 
25 the chimeric gene described above may be further supplemented by directing the coding 
sequence to encode the instant polypeptides with appropriate intracellular targeting 
sequences such as transit sequences (Keegstra (1989) Cell 56:247-253), signal sequences or 
sequences encoding endoplasmic reticulum localization (Chrispeels (1991) Ann. Rev. Plant 
Phys. Plant MoL Biol. 42:21-53), or nuclear localization signals (Raikhel (1992) Plant 
30 Phys. 1 00: 1 627- 1 63 2) with or without removing targeting sequences that are already present. 
While the references cited give examples of each of these, the list is not exhaustive and more 
targeting signals of use may be discovered in the future. 

It may also be desirable to reduce or eliminate expression of genes encoding the 
instant polypeptides in plants for some applications. In order to accomplish this, a chimeric 
35 gene designed for co-suppression of the instant polypeptide can be constructed by linking a 
gene or gene fragment encoding that polypeptide to plant promoter sequences. 
Alternatively, a chimeric gene designed to express antisense RNA for all or part of the 
instant nucleic acid fragment can be constructed by linking the gene or gene fragment in 
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reverse orientation to plant promoter sequences. Either the co-suppression or antisense 
chimeric genes could be introduced into plants via transformation wherein expression of the 
corresponding endogenous genes are reduced or eliminated. 

Molecular genetic solutions to the generation of plants with altered gene expression 
5 have a decided advantage over more traditional plant breeding approaches. Changes in plant 
phenotypes can be produced by specifically inhibiting expression of one or more genes by 
antisense inhibition or cosuppression (U.S. Patent Nos. 5,190,931, 5,107,065 and 
5,283,323). An antisense or cosuppression construct would act as a dominant negative 
regulator of gene activity. While conventional mutations can yield negative regulation of 
10 gene activity these effects are most likely recessive. The dominant negative regulation 

available with a transgenic approach may be advantageous from a breeding perspective. In 
addition, the ability to restrict the expression of specific phenotype to the reproductive 
tissues of the plant by the use of tissue specific promoters may confer agronomic advantages 
relative to conventional mutations which may have an effect in all tissues in which a mutant 
15 gene is ordinarily expressed. 

The person skilled in the art will know that special considerations are associated with 
the use of antisense or cosuppression technologies in order to reduce expression of particular 
genes. For example, the proper level of expression of sense or antisense genes may require 
the use of different chimeric genes utilizing different regulatory elements known to the 
20 skilled artisan. Once transgenic plants are obtained by one of the methods described above, 
it will be necessary to screen individual transgenics for those that most effectively display 
the desired phenotype. Accordingly, the skilled artisan will develop methods for screening 
large numbers of transformants. The nature of these screens will generally be chosen on 
practical grounds. For example, one can screen by looking for changes in gene expression 
25 by using antibodies specific for the protein encoded by the gene being suppressed, or one 

could establish assays that specifically measure enzyme activity. A preferred method will be 
one which allows large numbers of samples to be processed rapidly, since it will be expected 
that a large number of transformants will be negative for the desired phenotype. 

The instant polypeptides (or portions thereof) may be produced in heterologous host 
30 cells, particularly in the cells of microbial hosts, and can be used to prepare antibodies to the 
these proteins by methods well known to those skilled in the art. The antibodies are useful 
for detecting the polypeptides of the instant invention in situ in cells or in vitro in cell 
extracts. Preferred heterologous host cells for production of the instant polypeptides are 
microbial hosts. Microbial expression systems and expression vectors containing regulatory 
35 sequences that direct high level expression of foreign proteins are well known to those 

skilled in the art. Any of these could be used to construct a chimeric gene for production of 
the instant polypeptides. This chimeric gene could then be introduced into appropriate 
microorganisms via transformation to provide high level expression of the encoded 
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UDP-galactose 4-epimerase. An example of a vector for high level expression of the instant 
polypeptides in a bacterial host is provided (Example 6). 

All or a substantial portion of the nucleic acid fragments of the instant invention may 
also be used as probes for genetically and physically mapping the genes that they are a part 

5 of, and as markers for traits linked to those genes. Such information may be useful in plant 
breeding in order to develop lines with desired phenotypes. For example, the instant nucleic 
acid fragments may be used as restriction fragment length polymorphism (RFLP) markers. 
Southern blots (Maniatis) of restriction-digested plant genomic DNA may be probed with 
the nucleic acid fragments of the instant invention. The resulting banding patterns may then 

10 be subjected to genetic analyses using computer programs such as MapMaker (Lander et aL 
(1987) Genomics 7:174-181) in order to construct a genetic map. In addition, the nucleic 
acid fragments of the instant invention may be used to probe Southern blots containing 
restriction endonuclease-treated genomic DNAs of a set of individuals representing parent 
and progeny of a defined genetic cross. Segregation of the DNA polymorphisms is noted 

15 and used to calculate the position of the instant nucleic acid sequence in the genetic map 
previously obtained using this population (Botstein et al. (1980) Am. J. Hum, Genet 
52:314-331). 

The production and use of plant gene-derived probes for use in genetic mapping is 
described in Bernatzky and Tanksley (1986) Plant Mol. Biol Reporter 4:31 r -41. Numerous 
20 publications describe genetic mapping of specific cDNA clones using the methodology 
outlined above or variations thereof. For example, F2 intercross populations, backcross 
populations, randomly mated populations, near isogenic lines, and other sets of individuals 
may be used for mapping. Such methodologies are well known to those skilled in the art. 
Nucleic acid probes derived from the instant nucleic acid sequences may also be used 
25 for physical mapping (i.e., placement of sequences on physical maps; see Hoheisel et al. In: 
Nonmammalian Genomic Analysis: A Practical Guide, Academic press 1996, pp. 319-346, 
and references cited therein). 

In another embodiment, nucleic acid probes derived from the instant nucleic acid 
sequences may be used in direct fluorescence in situ hybridization (FISH) mapping (Trask 
30 (1991) Trends Genet. 7:149-154). Although current methods of FISH mapping favor use of 
large clones (several to several hundred KB; see Laan et al. (1995) Genome Res. 5:13-20), 
improvements in sensitivity may allow performance of FISH mapping using shorter probes. 

A variety of nucleic acid amplification-based methods of genetic and physical 
mapping may be carried out using the instant nucleic acid sequences. Examples include 
35 allele-specific amplification (Kazazian (1989) J. Lab. Clin. Med. 77:95-96), polymorphism 
of PCR-amplified fragments (CAPS; Sheffield et aL (1993) Genomics 76:325-332), allele- 
specific ligation (Landegren et al. (1988) Science 247:1077-1080), nucleotide extension 
reactions (Sokolov (1990) Nucleic Acid Res. 75:3671), Radiation Hybrid Mapping (Walter 
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et al. (1997) Nat. Genet. 7:22-28) and Happy Mapping (Dear and Cook (1989) Nucleic Acid 
Res. 77:6795-6807). For these methods, the sequence of a nucleic acid fragment is used to 
design and produce primer pairs for use in the amplification reaction or in primer extension 
reactions. The design of such primers is well known to those skilled in the art. In methods 
5 employing PCR-based genetic mapping, it may be necessary to identify DNA sequence 
differences between the parents of the mapping cross in the region corresponding to the 
instant nucleic acid sequence. This, however, is generally not necessary for mapping 
methods. 

Loss of function mutant phenotypes may be identified for the instant cDNA clones 

10 either by targeted gene disruption protocols or by identifying specific mutants for these 
genes contained in a maize population carrying mutations in all possible genes (Ballinger 
and Benzer (1989) Proc. Natl Acad Sci USA 56:9402-9406; Koes et al. (1995) Proc, Natl 
Acad. Sci USA 92:8149-8153; Bensen et al. (1995) Plant Cell 7:75-84). The latter approach 
may be accomplished in two ways. First, short segments of the instant nucleic acid 

15 fragments may be used in polymerase chain reaction protocols in conjunction with a 
mutation tag sequence primer on DNAs prepared from a population of plants in which 
Mutator transposons or some other mutation-causing DNA element has been introduced 
(see Bensen, supra). The amplification of a specific DNA fragment with these primers 
indicates the insertion of the mutation tag element in or near the plant gene encoding the 

20 instant polypeptides. Alternatively, the instant nucleic acid fragment may be used as a 
hybridization probe against PCR amplification products generated from the mutation 
population using the mutation tag sequence primer in conjunction with an arbitrary genomic 
site primer, such as that for a restriction enzyme site-anchored synthetic adaptor. With 
either method, a plant containing a mutation in the endogenous gene encoding the instant 

25 polypeptides can be identified and obtained. This mutant plant can then be used to 

determine or confirm the natural function of the instant polypeptides disclosed herein. 

EXAMPLES 

The present invention is further defined in the following Examples, in which all parts 
and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be 
30 understood that these Examples, while indicating preferred embodiments of the invention, 
are given by way of illustration only. From the above discussion and these Examples, one 
skilled in the art can ascertain the essential characteristics of this invention, and without 
departing from the spirit and scope thereof, can make various changes and modifications of 
the invention to adapt it to various usages and conditions. 
35 EXAMPLE 1 

Composition of cDNA Libraries; Isolation and Sequencing of cDNA Clones 
cDNA libraries representing mRNAs from various corn, rice, soybean, and wheat 
tissues were prepared. The characteristics of the libraries are described below. 
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TABLE 2 

cDNA Libraries from Corn, Rice, Soybean, and Wheat 



Library 


Tissue 


Clone 


cen3n 


Corn Endosperm 20 Days After Pollination* 


cen3n.pk0155.b8 


p0083 


Corn Whole Kernels 7 Days After Pollination 


P 0083.clddm72r 


rlr2 


Rice Leaf 15 Days After Germination, 2 Hours After 
Infection of Strain Magaporthe grisea 4360-R-62 
(AVR2-YAMO); Resistant 


rlr2.pk0043.c3 


rls24 


Rice Leaf 15 Days After Germination, 24 Hours After 
Infection of Strain Magaporthe grisea 4360-R-67 
(AVR2-YAMO); Susceptible 


rls24.pk0008.dl2 


se6 


Soybean Embryo, 26 Days After Flowering 


se6.pk0014.fl2 


sls2c 


Soybean Infected With Sclerotinia sclerotiorum Mycelium 


sls2c.pk017.k22 


wdkSc 


Wheat Developing Kernel, 30 Days After Anthesis 


wdk5c.pk006.o4 


wlmO 


Wheat Seedlings 0 Hour After Inoculation With Erysiphe 
graminis f. sp tritici 


wlmO.pk0015.g3 



5 cDNA libraries may be prepared by any one of many methods available. For 

example, the cDNAs may be introduced into plasmid vectors by first preparing the cDNA 
libraries in Uni-ZAP™ XR vectors according to the manufacturer's protocol (Stratagene 
Cloning Systems, La Jolla, CA). The Uni-ZAP™ XR libraries are converted into plasmid 
libraries according to the protocol provided by Stratagene. Upon conversion, cDNA inserts 
10 will be contained in the plasmid vector pBluescript In addition, the cDNAs may be 

introduced directly into precut Bluescript II SK(+) vectors (Stratagene) using T4 DNA 
ligase (New England Biolabs), followed by transfection into DH10B cells according to the 
manufacturer's protocol (GIBCO BRL Products). Once the cDNA inserts are in plasmid 
vectors, plasmid DNAs are prepared from randomly picked bacterial colonies containing 
15 recombinant pBluescript plasmids, or the insert cDNA sequences are amplified via 

polymerase chain reaction using primers specific for vector sequences flanking the inserted 
cDNA sequences. Amplified insert DNAs or plasmid DNAs are sequenced in dye-primer 
sequencing reactions to generate partial cDNA sequences (expressed sequence tags or 
"ESTs"; see Adams et al., (1991) Science 252:1651-1656). The resulting ESTs are analyzed 
20 using a Perkin Elmer Model 377 fluorescent sequencer. 

EXAMPLE 2 
Identification of cDNA Clones 
cDNA clones encoding UDP-galactose 4-epimerases were identified by conducting 
BLAST (Basic Local Alignment Search Tool; Altschul et al. (1993) J. Mol Biol 
25 275:403-410; see also www.ncbi.nlm.nih.gov/BLAST/) searches for similarity to sequences 
contained in the BLAST "nr" database (comprising all non-redundant GenBank CDS 
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translations, sequences derived from the 3 -dimensional structure Brookhaven Protein Data 
Bank, the last major release of the SWISS-PROT protein sequence database, EMBL, and 
DDBJ databases). The cDNA sequences obtained in Example 1 were analyzed for similarity 
to all publicly available DNA sequences contained in the "nr" database using the BLASTN 

5 algorithm provided by the National Center for Biotechnology Information (NCBI). The 
DNA sequences were translated in all reading frames and compared for similarity to all 
publicly available protein sequences contained in the "nr" database using the BLASTX 
algorithm (Gish and States (1993) Nat. Genet 5:266-272) provided by the NCBI. For 
convenience, the P-value (probability) of observing a match of a cDNA sequence to a 

10 sequence contained in the searched databases merely by chance as calculated by BLAST are 
reported herein as "pLog" values, which represent the negative of the logarithm of the 
reported P-value. Accordingly, the greater the pLog value, the greater the likelihood that the 
cDNA sequence and the BLAST "hit" represent homologous proteins. 

EXAMPLE 3 

15 Characterization of cDNA Clones Encoding UDP-Galactose 4-Epimerase 

The BLASTX search using the EST sequences from clones listed in Table 3 revealed 
similarity of the polypeptides encoded by the cDNAs to UDP-galactose 4-epimerase from 



Pisum sativum and Cyamopsis tetragonoloba (NCBI General Identifier No. 1 173555 and 


3021357, respectively). Shown in Table 3 are the BLAST results for individual ESTs 


("EST"): 










TABLE 3 




BLAST Results for Sequences Encoding Polypeptides 




Homologous 


to UDP-Galactose 4-Epimerase 






BLAST pLog Score 


Clone 


Status 


1173555 3021357 


cen3n.pk0155.b8 


EST 


76.00 90.40 


rlr2.pk0043.c3 


EST 


24.10 35.52 


sls2c.pk017.k22 


EST 


66.52 40.40 


wdk5c.pk006.o4 


EST 


68.70 40.40 



The sequence of the entire cDNA insert in the clones mentioned above was 
determined. Further analyses of the data indicated that there are two forms of 
UDP-galactose 4-epimerase, a cytoplasmic form similar to the Pisum sativum sequence, and 
a plastid form similar to the Cyamopsis tetragonoloba sequence. ESTs encoding both kinds 
30 of UDP-galactose 4-epimerases were found in the DuPont proprietary database. The 

BLAST search using the sequences from clones listed in Table 4 revealed similarity of the 
polypeptides encoded by the cDNAs to UDP-galactose 4-epimerase (cytoplasmic) from 
Pisum sativum (NCBI General Identifier No. 1 173555). Shown in Table 4 are the BLAST 
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results for individual ESTs ("EST"), 


or for the sequences of the entire cDN A inserts 


comprising the indicated cDNA clones and encoding the entire protein ("CGS"): 






TABLE 4 


BLAST Results for Sequences Encoding Polypeptides 


Homologous to Cytoplasmic UDP-Galactose 4-Epimerase 






BLAST pLog Score 


Clone 


Status 


1173555 


p0083.clddm72r 


EST 


84.30 


rls24.pk0008.dl2 


EST 


26.10 


sls2c.pk017.k22:fis 


CGS 


>254.00 


wdk5c.pk006.o4:fis 


CGS 


154.00 



The BLAST search using the sequences from clones listed in Table 5 revealed 
similarity of the polypeptides encoded by the cDNAs to UDP-galactose 4-epimerase 
10 (plastid) from Cyamopsis tetragonoloba (NCBI General Identifier No. 3021357). Shown in 
Table 5 are the BLAST results for individual ESTs ("EST"), the sequences of the entire 
cDNA inserts comprising the indicated cDNA clones ("FIS"), or FIS sequences encoding 
the entire protein ("CGS"): 

15 TABLE 5 



BLAST Results for Sequences Encoding Polypeptides 
Homologous to Plastid UDP-Galactose 4-Epimerase 



Clone 


Status 


BLAST pLog Score 
3021357 


cen3n.pk0155.b8:fis 


FIS 


138.00 


rlr2.pk0043.c3:fis 


CGS 


165.00 


se6.pk0014.fl2 


CGS 


>254.00 


wlmO.pk0015.g3 


EST 


21.00 



Figure 1 presents an alignment of the amino acid sequences set forth in SEQ ID 
20 NOs:14, 16, 18, and 22 and the Pisum sativum and Cyamopsis tetragonoloba sequences 
(SEQ ID NO:25 and SEQ ID NO:26). The amino acid sequence from clone 
cen3n.pk0155.b8:fis contains 353 amino acids and the amino acid sequence from Cyamopsis 
tetragonoloba contains 350 amino acids, but the alignment between both sequences starts at 
amino acid 65 of the Cyamopsis tetragonoloba sequence. The data in Table 6 represents a 
25 calculation of the percent identity of the amino acid sequences set forth in SEQ ID NOs:2, 4, 
6, 8, 10, 12, 14, 16, 18, 20, 22, and 24 and the Pisum sativum and Cyamopsis tetragonoloba 
sequences (SEQ ID NO:25 and SEQ ID NO:26). 
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TABLE 6 

Percent Identity of Amino Acid Sequences Deduced From the Nucleotide Sequences 
Sequences of cDNA Clones Encoding Polypeptides 



bLvJ 1U NO. 


Percent Identity to 

1 1 7^SSS 
l i / j jj j 


3021357 


*> 


66 3 

\J\J . 


77.5 


A 

4 




67.7 


O 


7Q 1 


53.4 


8 


66.0 


54.9 


10 


56.3 


53.3 


12 


47.5 


44.4 


14 


90.0 


64.9 


16 


71.1 


62.6 


18 


56.3 


64.3 


20 


64.3 


78.9 


22 


63.3 


87.1 


24 


29.1 


45.6 



Sequence alignments and percent identity calculations were performed using the 
Megalign program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., 
Madison, WI). Multiple alignment of the sequences was performed using the Clustal 
method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the default 
10 parameters (GAP PENALTY-10, GAP LENGTH PEN ALT Y= 10). Default parameters for 
pairwise alignments using the Clustal method were KTUPLE 1, GAP PENALTY=3, 
WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments and BLAST scores and 
probabilities indicate that the nucleic acid fragments comprising the instant cDN A clones 
encode a substantial portion or entire corn, rice, soybean, and wheat cytoplasmic 
15 UDP-galactose 4-epimerase and a substantial portion or entire corn, rice, soybean, and 

wheat plastidic UDP-galactose 4-epimerase. These sequences represent the first corn, rice, 
soybean, and wheat sequences encoding UDP-galactose 4-epimerase. 

EXAMPLE 4 
Expression of Chimeric Genes in Monocot Cells 
20 A chimeric gene comprising a cDNA encoding the instant polypeptides in sense 

orientation with respect to the maize 27 kD zein promoter that is located 5' to the cDNA 
fragment, and the 10 kD zein 3' end that is located 3' to the cDNA fragment, can be 
constructed. The cDNA fragment of this gene may be generated by polymerase chain 
reaction (PGR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites 
25 (Ncol or Smal) can be incorporated into the oligonucleotides to provide proper orientation 
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of the DNA fragment when inserted into the digested vector pML103 as described beiow. 
Amplification is then performed in a standard PCR. The amplified DNA is then digested 
with restriction enzymes Ncol and Smal and fractionated on an agarose gel. The appropriate 
band can be isolated from the gel and combined with a 4.9 kb Ncol-Smal fragment of the 
5 plasmid pML 1 03 . Plasmid pML 1 03 has been deposited under the terms of the Budapest 
Treaty at ATCC (American Type Culture Collection, 10801 University Blvd., Manassas, 
VA 201 10-2209), and bears accession number ATCC 97366. The DNA segment from 
pML103 contains a 1.05 kb Sall-Ncol promoter fragment of the maize 27 kD zein gene and 
a 0.96 kb Smal-Sall fragment from the 3' end of the maize 10 kD zein gene in the vector 
10 pGem9Zf(+) (Promega). Vector and insert DNA can be ligated at 15°C overnight, 

essentially as described (Maniatis). The ligated DNA may then be used to transform E. coli 
XLl-Blue (Epicurian Coli XL-1 Blue™; Stratagene). Bacterial transformants can be 
screened by restriction enzyme digestion of plasmid DNA and limited nucleotide sequence 
analysis using the dideoxy chain termination method (Sequenase™ DNA Sequencing Kit; 
15 U.S. Biochemical). The resulting plasmid construct would comprise a chimeric gene 
encoding, in the 5' to 3* direction, the maize 27 kD zein promoter, a cDNA fragment 
encoding the instant polypeptides, and the 10 kD zein 3' region. 

The chimeric gene described above can then be introduced into com cells by the 
following procedure. Immature corn embryos can be dissected from developing caryopses 
20 derived from crosses of the inbred corn lines H99 and LH132. The embryos are isolated 10 
to 1 1 days after pollination when they are 1.0 to 1.5 mm long. The embryos are then placed 
with the axis-side facing down and in contact with agarose-solidified N6 medium (Chu et al. 
(1975) ScL Sin. Peking 18:659-668). The embryos are kept in the dark at 27°C. Friable 
embryogenic callus consisting of undifferentiated masses of cells with somatic 
25 proembryoids and embryoids borne on suspensor structures proliferates from the scutellum 
of these immature embryos. The embryogenic callus isolated from the primary explant can 
be cultured on N6 medium and sub-cultured on this medium every 2 to 3 weeks. 

The plasmid, p35S/Ac (obtained from Dr. Peter Eckes, Hoechst Ag, Frankfurt, 
Germany) may be used in transformation experiments in order to provide for a selectable 
30 marker. This plasmid contains the Pat gene (see European Patent Publication 0 242 236) 
which encodes phosphinothricin acetyl transferase (PAT). The enzyme PAT confers 
resistance to herbicidal glutamine synthetase inhibitors such as phosphinothricin. The pat 
gene in p35S/Ac is under the control of the 35S promoter from Cauliflower Mosaic Virus 
(Odell et al. (1985) Nature 313:810-812) and the 3* region of the nopaline synthase gene 
35 from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. 

The particle bombardment method (Klein et al. (1987) Nature 327:70-73) may be used 
to transfer genes to the callus culture cells. According to this method, gold particles (1 \im 
in diameter) are coated with DNA using the following technique. Ten jig of plasmid DNAs 
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are added to 50 \xL of a suspension of gold particles (60 mg per mL). Calcium chloride 
(50 [iL of a 2.5 M solution) and spermidine free base (20 ^iL of a 1.0 M solution) are added 
to the particles. The suspension is vortexed during the addition of these solutions. After 
1 0 minutes, the tubes are briefly centrifuged (5 sec at 1 5,000 rpm) and the supernatant 
5 removed. The particles are resuspended in 200 |llL of absolute ethanol, centrifuged again 
and the supernatant removed. The ethanol rinse is performed again and the particles 
resuspended in a final volume of 30 p,L of ethanol. An aliquot (5 |xL) of the DNA-coated 
gold particles can be placed in the center of a Kapton™ flying disc (Bio-Rad Labs). The 
particles are then accelerated into the com tissue with a Biolistic™ PDS-1000/He (Bio-Rad 

10 Instruments, Hercules CA), using a helium pressure of 1 000 psi, a gap distance of 0.5 cm 
and a flying distance of 1.0 cm. 

For bombardment, the embryogenic tissue is placed on filter paper over agarose- 
solidified N6 medium. The tissue is arranged as a thin lawn and covered a circular area of 
about 5 cm in diameter. The petri dish containing the tissue can be placed in the chamber of 

15 the PDS-1000/He approximately 8 cm from the stopping screen. The air in the chamber is 
then evacuated to a vacuum of 28 inches of Hg. The macrocarrier is accelerated with a 
helium shock wave using a rupture membrane that bursts when the He pressure in the shock 
tube reaches 1000 psi. 

Seven days after bombardment the tissue can be transferred to N6 medium that 

20 contains gluphosinate (2 mg per liter) and lacks casein or proline. The tissue continues to 
grow slowly on this medium. After an additional 2 weeks the tissue can be transferred to 
fresh N6 medium containing gluphosinate. After 6 weeks, areas of about 1 cm in diameter 
of actively growing callus can be identified on some of the plates containing the glufosinate- 
supplemented medium. These calli may continue to grow when sub-cultured on the 

25 selective medium. 

Plants can be regenerated from the transgenic callus by first transferring clusters of 
tissue to N6 medium supplemented with 0.2 mg per liter of 2,4-D. After two weeks the 
tissue can be transferred to regeneration medium (Fromm et al. (1990) Bio/Technology 
5:833-839). 

30 EXAMPLE 5 

Ex pression of Chimeric Genes in Dicot Cells 
A seed-specific expression cassette composed of the promoter and transcription 
terminator from the gene encoding the p subunit of the seed storage protein phaseolin from 
the bean Phaseolus vulgaris (Doyle et al. (1986) J. Biol Chem. 261 :9228-9238) can be used 
35 for expression of the instant polypeptides in transformed soybean. The phaseolin cassette 
includes about 500 nucleotides upstream (5') from the translation initiation codon and about 
1650 nucleotides downstream (3') from the translation stop codon of phaseolin. Between the 
5* and 3* regions are the unique restriction endonuclease sites Nco I (which includes the ATG 
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translation initiation codon), Sma I, Kpn I and Xba I. The entire cassette is flanked by 
Hind III sites. 

The cDNA fragment of this gene may be generated by polymerase chain reaction 
(PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites can be 
5 incorporated into the oligonucleotides to provide proper orientation of the DNA fragment 
when inserted into the expression vector. Amplification is then performed as described 
above, and the isolated fragment is inserted into a pUC 1 8 vector carrying the seed 
expression cassette. 

Soybean embryos may then be transformed with the expression vector comprising 
10 sequences encoding the instant polypeptides. To induce somatic embryos, cotyledons, 

3-5 mm in length dissected from surface sterilized, immature seeds of the soybean cultivar 
A2872, can be cultured in the light or dark at 26°C on an appropriate agar medium for 
6-10 weeks. Somatic embryos which produce secondary embryos are then excised and 
placed into a suitable liquid medium. After repeated selection for clusters of somatic 
15 embryos which multiplied as early, globular staged embryos, the suspensions are maintained 
as described below. 

Soybean embryogenic suspension cultures can maintained in 35 mL liquid media on a 
rotary shaker, 150 rpm, at 26°C with florescent lights on a 16:8 hour day/night schedule. 
Cultures are subcultured every two weeks by inoculating approximately 35 mg of tissue into 

20 35 mL of liquid medium. 

Soybean embryogenic suspension cultures may then be transformed by the method of 
particle gun bombardment (Klein et al. (1987) Nature (London) 527:70-73, U.S. Patent 
No. 4,945,050). A DuPont Biolistic™ PDS1000/HE instrument (helium retrofit) can be used 
for these transformations. 

25 A selectable marker gene which can be used to facilitate soybean transformation is a 

chimeric gene composed of the 35S promoter from Cauliflower Mosaic Virus (Odell et al. 
(1985) Nature 575:810-812), the hygromycin phosphotransferase gene from plasmid pJR225 
(from E. coli; Gritz et al.(1983) Gene 25:179-188) and the 3' region of the nopaline synthase 
gene from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. The seed expression 

30 cassette comprising the phaseolin 5' region, the fragment encoding the instant polypeptides 
and the phaseolin 3 1 region can be isolated as a restriction fragment. This fragment can then 
be inserted into a unique restriction site of the vector carrying the marker gene. 

To 50 pL of a 60 mg/mL 1 pm gold particle suspension is added (in order): 5 pL 
DNA (1 pg/pL), 20 jj,1 spermidine (0.1 M), and 50 pL CaCl 2 (2.5 M). The particle 

35 preparation is then agitated for three minutes, spun in a microfuge for 10 seconds and the 
supernatant removed. The DNA-coated particles are then washed once in 400 pL 70% 
ethanol and resuspended in 40 pL of anhydrous ethanol. The DNA/particle suspension can 
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be sonicated three times for one second each. Five uL of the DNA-coated gold particles are 
then loaded on each macro carrier disk. 

Approximately 300-400 mg of a two-week-old suspension culture is placed in an 
empty 60x15 mm petri dish and the residual liquid removed from the tissue with a pipette. 
5 For each transformation experiment, approximately 5-10 plates of tissue are normally 

bombarded. Membrane rupture pressure is set at 1 100 psi and the chamber is evacuated to a 
vacuum of 28 inches mercury. The tissue is placed approximately 3.5 inches away from the 
retaining screen and bombarded three times. Following bombardment, the tissue can be 
divided in half and placed back into liquid and cultured as described above. 
10 Five to seven days post bombardment, the liquid media may be exchanged with fresh 

media, and eleven to twelve days post bombardment with fresh media containing 50 mg/mL 
hygromycin. This selective media can be refreshed weekly. Seven to eight weeks post 
bombardment, green, transformed tissue may be observed growing from untransformed, 
necrotic embryogenic clusters. Isolated green tissue is removed and inoculated into 
15 individual flasks to generate new, clonally propagated, transformed embryogenic suspension 
cultures. Each new line may be treated as an independent transformation event. These 
suspensions can then be subcultured and maintained as clusters of immature embryos or 
regenerated into whole plants by maturation and germination of individual somatic embryos. 

EXAMPLE 6 

20 Expression of Chimeric Genes in Microbial Cells 

The cDNAs encoding the instant polypeptides can be inserted into the T7 E. coli 
expression vector pBT430. This vector is a derivative of pET-3a (Rosenberg et al. (1987) 
Gene 56:125-135) which employs the bacteriophage T7 RNA polymerase/T7 promoter 
system. Plasmid pBT430 was constructed by first destroying the EcoR I and Hind III sites in 
25 pET-3a at their original positions. An oligonucleotide adaptor containing EcoR I and 
Hind III sites was inserted at the BamH I site of pET-3a. This created pET-3aM with 
additional unique cloning sites for insertion of genes into the expression vector. Then, the 
Nde I site at the position of translation initiation was converted to an Nco I site using 
oligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this region, 
30 5'-CATATGG, was converted to 5'-CCCATGG in pBT430. 

Plasmid DNA containing a cDNA may be appropriately digested to release a nucleic 
acid fragment encoding the protein. This fragment may then be purified on a 1% NuSieve 
GTG™ low melting agarose gel (FMC). Buffer and agarose contain 10 u.g/ml ethidium 
bromide for visualization of the DNA fragment. The fragment can then be purified from the 
35 agarose gel by digestion with GELase™ (Epicentre Technologies) according to the 

manufacturer's instructions, ethanol precipitated, dried and resuspended in 20 |xL of water. 
Appropriate oligonucleotide adapters may be ligated to the fragment using T4 DNA ligase 
(New England Biolabs, Beverly, MA). The fragment containing the ligated adapters can be 
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purified from the excess adapters using low melting agarose as described above. The vector 
pBT430 is digested, dephosphorylated with alkaline phosphatase (NEB) and deproteinized 
with phenol/chloroform as described above. The prepared vector pBT430 and fragment can 
then be ligated at 16°C for 15 hours followed by transformation into DH5 electrocompetent 
5 cells (GIBCO BRL). Transformants can be selected on agar plates containing LB media and 
100 jig/mL ampicillin. Transformants containing the gene encoding the instant polypeptides 
are then screened for the correct orientation with respect to the T7 promoter by restriction 
enzyme analysis. 

For high level expression, a plasmid clone with the cDNA insert in the correct 
10 orientation relative to the T7 promoter can be transformed into E. coli strain BL21(DE3) 
(Studier et al. (1986) J. Mol Biol 189:1 13-130). Cultures are grown in LB medium 
containing ampicillin (100 mg/L) at 25°C. At an optical density at 600 nm of approximately 
1, IPTG (isopropylthio-P-galactoside, the inducer) can be added to a final concentration of 
0.4 mM and incubation can be continued for 3 h at 25°. Cells are then harvested by 
15 centrifugation and re-suspended in 50 |jL of 50 mM Tris-HCl at pH 8.0 containing 0.1 mM 
DTT and 0.2 mM phenyl methylsulfonyl fluoride. A small amount of 1 mm glass beads can 
be added and the mixture sonicated 3 times for about 5 seconds each time with a microprobe 
sonicator. The mixture is centrifuged and the protein concentration of the supernatant 
determined. One \xg of protein from the soluble fraction of the culture can be separated by 
20 SDS-polyacrylamide gel electrophoresis. Gels can be observed for protein bands migrating 
at the expected molecular weight. 

EXAMPLE 7 

Functional Expression of the Soybean UDP-Galactose 4-Epimerase in E. coli 
Soybean varieties with inherently reduced raffmose saccharide content would improve 

25 the nutritional quality of derived soy protein products and reduce processing costs associated 
with the removal of raffmose saccharides. Said low raffmose saccharide soybean varieties 
would be more valuable than conventional varieties for animal and human diets and would 
allow mankind to more fully utilize the desirable nutritional qualities of this edible legume. 
Soybean clone sls2c.pk017.k22:fis (SEQ ID NO: 14), encoding an almost entire 

30 UDP-galactose 4-epimerase, was cloned into a pET24d vector and transformed into DH5oc 
competent cells to determine its activity in microbial cells. The fragment encoding the 
soybean UDP-galactose 4-epimerase was released from the BS-SK vector using restriction 
enzymes Eco RI and Sma I which are located in the multiple cloning site of the vector. To 
obtain a blunt end, the Eco RI restriction site was filled-in using T4 DN A polymerase (New 

35 England Biolabs). Nco I adapters (SEQ ID NO:27 and SEQ ID NO:28) containing a start 
methionine and three additional amino acids were ligatedovernight at 16°C to the blunt- 
ended UDP-galactose 4-epimerase fragment. 
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(SEQ ID NO:27) CATGGAGGAGCAG 

CTCCTCGTC ( SEQ ID NO:28) 

After heat-inactivation of the ligase, adapter ends were phosphorylated with T4 
5 polynucleotide kinase (New England Biolabs) for 30 minutes at 37°C. The 1255 bp 

UDP-galactose 4-epimerase fragment was gel purified using a 1% low melting agarose gel 
following manufacturers directions (FMC). The purified UDP-galactose 4-epimerase 
fragment containing phosphorylated Nco I adapter ends was ligated into an Nco I restricted 
pET24d vector (Novagen) overnight at 16°C. The ligation was transformed into DH5oc 
10 competent cells and plated onto 2xYT/50 pg/ml kanamycin plates. Plasmid DNA was 

purified and screened for insert and orientation by restriction with Eco RI. A clone in the 
sense orientation with respect to the T7 promoter and a clone with the insert in the antisense 
orientation with respect to the T7 promoter (negative control) were transformed into 
BL21(DE3) competent cells (Novagen). 
15 Single colonies were grown overnight at 37°C in 2x YT medium containing 50 ng/ml 

kanamycin. The cultures were diluted 30 fold in fresh medium and allowed to grow for an 
additional 2 hours to an optical density (at 600 nm) of 1 .0. Expression of the cDNA insert 
was induced by addition of isopropyl p-D-thiogalactopyranoside (IPTG) to the cultures to a 
final concentration of 1 mM. Cells were harvested by centrifugation after 3 hours and 
20 resuspended in 100 yxl of 100 mM potassium phosphate at pH 7.0 containing 3 mM 

dithiothreitol (DTT) and 4 mM phenylmethylsulfonylfluoride. A small amount of 1 mm 
glass beads were added and the mixture was sonicated three times for about 5 seconds each 
time with a microprobe sonicator. The mixture was centrifuged and the supernatant 
containing the protein transferred to a fresh tube. 
25 For assay of UDP 4-epimerase activity the following components were prepared in 

100 mM phosphate buffer, pH 7.0: 20 mM NADP, 200 mM sodium pyrophosphate, 1 mM 
glucose 1,6 diphosphate, 0.5 mM DTT1, 1 unit/fil phosphoglucomutase, 1 unit/pl glucose 
6-phosphate dehyrogenase, 0.05 units/jal UDP-glucose pyrophosphorylase, 100 mM 
UDP-galactose and 0.04 units/pi UDP-galactose 4-epimerase (SIGMA). Each 269 jxl assay 
30 contained 1 80 pi potassium phosphate buffer, 25 (il NADP, 5 pi sodium pyrophosphate, 
25 (al glucose 1,6 diphosphate, 1 pi DTT, 8 pi phosphoglucomutase, 1 pi glucose 
6-pyrophosphorylase, and 20 pi cell extract (or UDP-galactose 4-epimerase). The reaction 
was initiated with the addition of 2 pi 100 mM UDP-galactose and the production of 
NADPH was followed by monitoring the absorbance at 340 nm using a Shimadzu UV160U 
35 spectrophotometer. A nine-fold increase in epimerase activity was observed in the vessels 
containing the soybean UDP-galactose 4-epimerase fragment in the sense orientation with 
respect to the T7 promoter over those containing the soybean UDP-galactose 4-epimerase 
fragment in the antisense orientation with respect to the T7 promoter. As expected, an 
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approximately 37 kDa-expressed protein was observed in the soluble fraction of the induced 
DE3 cells containing the sense construct of pET24d-epimerase 4a. 

EXAMPLE 8 

Functional Expression of the Soybean UDP-Galactose 

5 4-Epimerase in Soybean Somatic Embryos 

The ability to change the levels of the raffinosaccharide pathway by overexpressing the 
gene from soybean clone sls2c.pk017.k22:fis in soybean somatic embryos was tested by 
preparing transgenic soybean somatic embryos and assaying the raffinose, stachyose, and 
sucrose levels. A cosuppressed phenotype should have low to nondetectable levels of 

10 raffinose and stachyose and increased levels of sucrose and can be expressed as a ratio of 
sucrose/(raffmose + stachyose). A ratio of less than 1 is considered a wild type phenotype, 
while a ratio of greater than 2.0 is considered a cosuppressed event. 

The entire insert from clone sls2c.pk017.k22:fis was amplified in a standard PCR 
reaction on a Perkin Elmer Applied Biosystems GeneAmp PCR System using Pfu 

15 polymerase (Stratagene). The resulting fragment is bound by an Nco I site at the 5* end and 
by a Pst I fragment at the 3' end. This fragment was digested, isolated, and ligated into the 
Nco I/Pst I sites of plasmid pKS18HH (described in US Patent No. 5,846,784) which had 
been modified by the insertion of the soybean glycinin subunit Gl promoter and terminator 
signals at the Sac I site. The sequence of the soybean Gy 1 glycinin subunit Gl was 

20 published by Sims and Goldberg (1989, Nucl. Acids Res. 17:4386). The promoter sequence 
consists of nucleotides 1 through 690 and the terminator sequence consists of nucleotides 
3126 through 3527. The new plasmid was named Gl-epimerase and contains the Gyl 
promoter, the epimerase sequence, and the Gyl termination signal surrounded by Sac I sites 
in plasmid pKS18HH. 

25 Transformation of Soybean Somatic Embryo Cultures 

The following stock solutions and media were used for transformation and propagation 
of soybean somatic embryos: 



Stock Solutions 




Media 


MS Sulfate lOOx stock 


te/L) 


SB55 (v>cr Liter) 


MgS0 4 .7H 2 0 


37.0 


10 mL of each MS stock 


MnS0 4 .H 2 0 


1.69 


1 mL of B5 Vitamin stock 


ZnS0 4 .7H 2 0 


0.86 


0.8 g NH 4 N0 3 


CuS0 4 .5H 2 0 


0.0025 


3.033 gKN0 3 






1 mL 2,4-D (10 mg/mL stock) 


MS Halides lOOx stock 




0.667 g asparagine 


CaCl 2 .2H 2 0 


44.0 


pH5.7 


KI 


0.083 
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Co0 2 .6H 2 0 

KH 2 P0 4 

H3BO3 

Na 2 Mo0 4 .2H 2 0 

Na 2 EDTA 

FeS0 4 .7H 2 0 

B5 Vitamin stock 

myoinositol 
nicotinic acid 
pyridoxine HC1 
thiamine 



0.00125 
17.0 
0.62 
0.025 
3.724 
2.784 



100.0 
1.0 
1.0 
10.0 



SB 103 (per Liter) 

1 pk. Murashige & Skoog salt mixture* 
60 g maltose 

2 g gelrite 
pH5.7 

SB 148 (per Liter) 
1 pk. Murashige & Skoog salt 
mixture* 
60 g maltose 
1 mL B5 vitamin stock 
7 g agarose 

pH 5.7 

*(Gibco BRL) 



Soybean embryonic suspension cultures were maintained in 35 mL liquid media 
(SB55) on a rotary shaker (150 rpm) at 28°C with a mix of fluorescent and incandescent 
5 lights providing a 16 h day 8 h night cycle. Cultures were subcultured every 2 to 3 weeks by 
inoculating approximately 35 mg of tissue into 35 mL of fresh liquid media. 

Soybean embryonic suspension cultures were transformed with Gl-epimerase by the 
method of particle gun bombardment (see Klein et al. (1987) Nature 327:70-73) using a 
DuPont Biolistic PDSIOOO/He instrument. Five jaL of Gl-epimerase plasmid DNA (1 g/L), 

10 50 ML CaCl 2 (2.5 M), and 20 ^L spermidine (0.1 M) were added to 50 |liL of a 60 mg/mL 

1 mm gold particle suspension. The particle preparation was agitated for 3 minutes, spun in 
a microfuge for 1 0 seconds and the supernate removed. The DNA-coated particles were 
then washed once with 400 jxL of 70% ethanol and resuspended in 40 of anhydrous 
ethanol. The DNA/particle suspension was sonicated three times for 1 second each. Five ]iL 

15 of the DNA-coated gold particles were then loaded on each macro carrier disk. 

Approximately 300 to 400 mg of two-week-old suspension culture was placed in an 
empty 60 mm X 15 mm petri dish and the residual liquid removed from the tissue using a 
pipette. The tissue was placed about 3.5 inches away from the retaining screen and 
bombarded twice. Membrane rupture pressure was set at 1 100 psi and the chamber was 

20 evacuated to -28 inches of Hg. Two plates were bombarded , and following bombardment, 
the tissue was divided in half, placed back into liquid media, and cultured as described 
above. 

Fifteen days after bombardment, the liquid media was exchanged with fresh SB55 
containing 50 mg/mL hygromycin. The selective media was refreshed weekly. Six weeks 
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after bombardment, green, transformed tissue was isolated and inoculated into flasks to 
generate new transformed embryonic suspension cultures. 

Transformed embryonic clusters were removed from liquid culture media and placed 
on a solid agar media, SB 103, containing 0.5% charcoal to begin maturation. After 1 week, 
5 embryos were transferred to SB 103 media minus charcoal. After 5 weeks on SB 103 media, 
maturing embryos were separated and placed onto SB 148 media. During maturation 
embryos were kept at 26°C with a mix of fluorescent and incandescent lights providing a 
16 h day 8 h night cycle. To mimic seed dry down, embryos were harvested after 5 weeks 
on SB 148 media. Each embryonic cluster gave rise to 5 to 20 somatic embryos. 

10 Non-transformed somatic embryos were cultured by the same method as used for the 

transformed somatic embryos. 
Analysis of Transformed Somatic Embryos 

At the end of the 5 th week on SB 148 medium somatic embryos were harvested from 14 
independently transformed lines. Soluble carbohydrates were extracted by crushing the 

15 embryos with a nylon pestle in a microfuge tube containing 200 p.L of 80% methanol. 

Extraction was repeated with an additional 200 \iL of 80% methanol and the supernatants 
combined and dried. The soluble carbohydrates were resuspended in 200 \iL water and 
analyzed using a Dionex DX500 chromatography system. Carbohydrates were separated on 
a Dionex CarboPac PAI (4x250 mm) column using 95% 0.2 M NaOH, 5% water at 

20 1 .0 ml/min. A total of 14 events (10 embryos each) were analyzed. The total area for the 
sugars raffinose, stachyose and sucrose were tabulated for each embryo. A cosuppressed 
phenotype should have low to nondetectable levels of raffinose and stachyose and increased 
levels of sucrose and can be expressed as a ratio of sucrose/(raffinose + stachyose). A ratio 
of less than 1 .0 is considered a wildtype phenotype, while a ratio of greater than 2.0 is 

25 considered a cosuppressed event. The averages and standard deviations for the areas of 

sucrose, raffinose, stachyose, and the ratio of sucrose/(raffinose + stachyose) for each of the 
14 samples are indicated in Table 7: 

TABLE 7 

30 Averages and Standard Deviations of the Carbohydrates From 

Somatic Soybean Embryos Expressing Chimeric 
Soybean UDP-Galactose 4-Epimerases 

Sucrose/ 



Somatic Embryo 


Sucrose 


Raffinose 


Stachyose 


(Raffinose+Stachyose 


4/4 


3568973.7 


1045112.8 


3967517 


1.02±0.8 




±1408264.7 


±641756.9 


±2900645.5 




4/5 


2856327.7 


904544 


3557979.3 


0.88±0.7 




±707852.7 


±521259.0 


±1715496.3 




4/7 


2877070.1 


717643.3 


3009836.7 


1±0.4 




±873920.3 


±609431.0 


±2407257.1 
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Sucrose/ 



Somatic Embryo 


Sucrose 


Raffinose 


Stachyose 


(Raffinose+Stachyose 


4/1 


2653179.9 


709370 


3876536.5 


0.77±0.5 




±1046953.1 


±379902.4 


±1999692.2 




4/2 


2857092.7 


626307.5 


3121925.9 


0.76±0.08 




±742415.0 


±115743.8 


±951294.5 




4/6 


3112203.2 


754341.9 


4601053 


0.61 ±0.1 5 




±850601.7 


±262408.2 


±1461924.7 




4/3 


3282564.1 


706353.5 


4602803.6 


0.58±0.17 




±1911513.1 


±428861.1 


±2261654.1 




3/3 


2691493.3 


536062.6 


2838255.8 


0.77±0.32 




±1538378.2 


±231855.5 


±1048200.9 




3/1 


2283160.5 


449773.1 


1983356 


1.44±1.25 




±1089482.4 


±229549.7 


±1099495.3 




3/4 


3375314.6 


616473.8 


3940545.5 


0.76±0.19 




±805313.2 


±185309.4 


±845544.6 




3/6 


81106208.1 


17813664.4 


101268706.9 


0.72±0.14 




±30013245.6 


±9546497.2 


±50277358.9 




3/2 


89847214.2 


17040544.3 


88496699.5 


1.05±0.70 




±14908804.2 


±5550687.9 


±34107697.8 




3/1 (repeat) 


73558780.2 


17948085.3 


73769338.2 


1.46±1.51 


±35218563.3 


±14008680.2 


±49942666.1 




3/5 


68427093.9 


13192646.4 


55486977 


1.24±0.75 




±20712691.0 


±9066329.2 


±36156784.6 





Of the 14 events analyzed, two were considered cosuppressed for UDP-glucose 
4'epimerase (4/1 and 3/1). Both of these events have at least 2 embryos that have a ratio 
5 greater than 2.0. Event 3/1 was repeated and both times exhibited cosuppression. 

Various modifications of the invention in addition to those shown and described herein 
will be apparent to those skilled in the art from the foregoing description. Such 
modifications are also intended to fall within the scope of the appended claims. 

The disclosure of each reference set forth above is incorporated herein by reference in 
10 its entirety. 
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CLAIMS 

What is claimed is: 

1 . An isolated polynucleotide comprising a first nucleotide sequence encoding a 
polypeptide of at least 90 amino acids that has at least 95% identity based on the Clustal 

5 method of alignment when compared to a polypeptide selected from the group consisting of 
SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, and 24, 

or a second nucleotide sequence comprising the complement of the first 
nucleotide sequence. 

2. The isolated polynucleotide of Claim 1 , wherein the first nucleotide sequence 
10 consists of a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 1,3, 

5, 7, 9, 11, 13, 15, 17, 19, 21, and 23 that codes for the polypeptide selected from the group 
consisting of SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, and 24. 

3. The isolated polynucleotide of Claim 1 wherein the nucleotide sequences are 

DNA. 

15 4. The isolated polynucleotide of Claim 1 wherein the nucleotide sequences are 

RNA. 

5. A chimeric gene comprising the isolated polynucleotide of Claim 1 operably 
linked to suitable regulatory sequences. 

6. A host cell comprising the chimeric gene of Claim 5. 

20 7. A host cell comprising an isolated polynucleotide of Claim 1 . 

8. The host cell of Claim 7 wherein the host cell is selected from the group 
consisting of yeast, bacteria, plant, and virus. 

9. A virus comprising the isolated polynucleotide of Claim 1 . 

10. A polypeptide of at least 90 amino acids that has at least 95% identity based on 
25 the Clustal method of alignment when compared to a polypeptide selected from the group 

consisting of SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, and 24. 

11. A method of selecting an isolated polynucleotide that affects the level of 
expression of a UDP-galactose 4-epimerase polypeptide in a plant cell, the method 
comprising the steps of: 

30 (a) constructing an isolated polynucleotide comprising a nucleotide sequence of 

at least one of 30 contiguous nucleotides derived from an isolated polynucleotide of 
Claim 1; 

(b) introducing the isolated polynucleotide into a plant cell; 

(c) measuring the level of a polypeptide in the plant cell containing the 
35 polynucleotide; and 

(d) comparing the level of polypeptide in the plant cell containing the isolated 
polynucleotide with the level of polypeptide in a plant cell that does not contain the isolated 
polynucleotide. 
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12. The method of Claim 1 1 wherein the isolated polynucleotide consists of a 
nucleotide sequence selected from the group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 11, 13, 
15, 17, 19, 21, and 23 that codes for the polypeptide selected from the group consisting of 
SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, and 24. 
5 13. A method of selecting an isolated polynucleotide that affects the level of 

expression of a UDP-galactose 4-epimerase polypeptide in a plant cell, the method 
comprising the steps of: 

(a) constructing an isolated polynucleotide of Claim 1 ; 

(b) introducing the isolated polynucleotide into a plant cell; 

10 (c) measuring the level of polypeptide in the plant cell containing the 

polynucleotide; and 

(d) comparing the level of polypeptide in the plant cell containing the isolated 
polynucleotide with the level of polypeptide in a plant cell that does not contain the 
polynucleotide. 

15 14. A method of obtaining a nucleic acid fragment encoding a UDP-galactose 

4-epimerase polypeptide comprising the steps of: 

(a) synthesizing an oligonucleotide primer comprising a nucleotide sequence of 
at least one of 30 contiguous nucleotides derived from a nucleotide sequence selected from 
the group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, and the 

20 complement of such nucleotide sequences; and 

(b) amplifying a nucleic acid sequence using the oligonucleotide primer. 

15. A method of obtaining a nucleic acid fragment encoding a UDP-galactose 
4-epimerase polypeptide comprising the steps of: 

(a) probing a cDNA or genomic library with an isolated polynucleotide 

25 comprising at least one of 30 contiguous nucleotides derived from a nucleotide sequence 
selected from the group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 
and the complement of such nucleotide sequences; 

(b) identifying a DNA clone that hybridizes with the isolated polynucleotide; 

(c) isolating the identified DNA clone; and 

30 (d) sequencing the cDNA or genomic fragment that comprises the isolated DNA 

clone. 

16. A composition comprising the isolated polynucleotide of Claim 1 . 

17. A composition comprising the isolated polynucleotide of Claim 10. 

18. An isolated polynucleotide comprising the nucleotide sequence having at least 
35 one of 30 contiguous nucleotides derived from a nucleic acid sequence selected from the 

group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, and the complement 
of such sequences. 
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19. An expression cassette comprising an isolated polynucleotide of Claim 1 
operably linked to a promoter. 

20. A method for positive selection of a transformed cell comprising: 

(a) transforming a host cell with the chimeric gene of Claim 5 or an expression 

5 cassette of Claim 20; and 

(b) growing the transformed host cell under conditions which allow expression 
of the polynucleotide in an amount sufficient to complement a null mutant to provide a 
positive selection means. 

2 1 . The method of Claim 20 wherein the plant cell is a monocot. 
10 22. The method of Claim 20 wherein the plant cell is a dicot. 
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SEQ ID 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 



NO:25 
NO:26 
NO:14 
NO:16 
NO: 18 
NO: 20 
NO: 22 



Aft/? 



20 



?Q0l 



MVAS S -QKI LVTGS AGFI GTHT WQLLNNGFN VS 1 1 DNFDNSVMEAVERVREWGSNLSQ 

MSS QT VLVTGGAGYIGSHT^QLLLGGFKAVVVDNLDNSSETAIHRVKEIiAGK-- FAG pA 77-, 

MVSSS^QHILVTGGAGFIGTHTWQLLKAGFSVSIIDNFDNSVMEAVDRVRQWGPLLSQ " ' C/V7" 
---AR^GSVXVTGGAGFIGTHTVLQLI^KGYAVTAVDNFHNSVPEAI.DRVRHIVGPAiSA ^N^p^^DS 

MVS ALLRTT L VTGGAGYI GSHTVLQLLQLGFRWVT, DNLDNASEIAILRVRELAGH-NAN 

MRD KT VLVTG G AG Y I GS HT VXQLLlLGG FRAWL DNLENS SE VAI HRVREIAGE- FGN 

1 60 



***** * *-*-•»* 



SEQ ID NO: 25 
SEQ ID NO:26 
SEQ ID NO: 14 
SEQ ID NO: 16 
SEQ ID NO: 18 
SEQ ID NO: 20 
SEQ ID NO: 22 



NLE FTLGDLRNKDDLEKL FSKSKFDAVI H FAGLKAVGES VENPRRYFDNNLVGT INLYEV 
NLSFHKLDLRDRDALEKIFSSTKFDSVIHFAGLKAVGESVQKPLLYYDNNLIGTIVLFEV 
NLQFTQG DLRNRDDLEKLFSKTT FDAVIHFAGLKAVAE S VAKPRRYFDFNI.VGT INL YE F 
RLQFI FGDLTI KDDLEKVFAAKKYDAVI HFAGI.KAVAE SVAH PEMYNRNNI VGTVNLYDV 

RIDLRDKGALEMVFASTRFEAVIHFAGLKAVGESVQKPLLYYDNNVIGTINLLEV 

NLDFRKVDLRDKQALDQI FSSQRFEAV I H FAGLKAVGES VQKPLL Y YDNNLI GTITLLQV 
NLSFHKVDLRDRAAZ.DQIFSSTQFDAVIHFAGLKAVGESVQKPI.LYYNNNLTGTITLLEV 
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(57) Abstract: This invention relates to an isolated nucleic acid fragment encoding a UDP-galactose 4-epimerase. The invention 
also relates to the construction of a chimeric gene encoding all or a portion of the UDP-galactose 4-epimerase, in sense or antisense 
orientation, wherein expression of the chimeric gene results in production of altered levels of the UDP-galactose 4-epimerase in a 
transformed host cell. 
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SEQUENCE LISTING 

<110> E. I. du Pont de Nemours and Company 

<120> UDP-Glucose Modifiers 

<130> BB1321 

<140> 
<141> 

<150> 60/119,588 

<151> 199 9- February- 1 0 

<160> 28 

<170> Microsoft Office 97 

<210> 1 

<211> 509 

<212> DNA 

<213> Zea mays 

<220> 

<221> unsure 
<222> (413) 



<220> 

<221> unsure 
<222> (465) 



<220> 

<221> unsure 
<222> (485) 



<400> 1 

gattgatctc cgtgacaagg gagcactgga 
tgtcattcac ttcgctggat tgaaagctgt 
ttatgacaac aacgtcattg gcacgataaa 
caagaagttg gtgttctcat catcagctgc 
cacagaaaat tttcctctta ctccaaacaa 
agatatttgc cgggatatct accgttcaga 
cttcaatcca gttggtgctc atcctagtgg 
caacaatctt atgccctatg ttcagcaagt 
tttangaaat gactatgcaa caagagatg 



aatggttttt gcttctacaa gatttgaagc 60 
gggtgaaagc gtacagaagc cattacttta 120 
tcttctagaa gttatgtctg ttcacggttg 180 
agtttatgga tcacccaaaa actcaccctg 240 
tccatatggc aaaacaaagc tcgttgttga 300 
tcctgaatgg aagatcattt tacttaggta 360 
atatcttggc gaggacccac gangaattcc 420 
tgcggttggt aagangccag ctctaacagt 480 

509 



<210> 2 

<211> 169 

<212> PRT 

<213> Zea mays 



<220> 

<221> UNSURE 
<222> (138) 



<220> 

<221> UNSURE 
<222> (155) 



1 



WO 00/47755 



PCT/US00/O3453 



<220> 

<221> UNSURE 
<222> (162) 

<400> 2 

He Asp Leu Arg Asp Lys Gly Ala Leu Glu Met Val Phe Ala Ser Thr 
15 10 15 

Arg Phe Glu Ala Val He His Phe Ala Gly Leu Lys Ala Val Gly Glu 
20 25 30 

Ser Val Gin Lys Pro Leu Leu Tyr Tyr Asp Asn Asn Val He Gly Thr 
35 40 45 

He Asn Leu Leu Glu Val Met Ser Val His Gly Cys Lys Lys Leu Val 
50 55 60 

Phe Ser Ser Ser Ala Ala Val Tyr Gly Ser Pro Lys Asn Ser Pro Cys 
65 70 75 80 

Thr Glu Asn Phe Pro Leu Thr Pro Asn Asn Pro Tyr Gly Lys Thr Lys 
85 90 95 

Leu Val Val Glu Asp He Cys Arg Asp He Tyr Arg Ser Asp Pro Glu 
100 105 HO 

Trp Lys He lie Leu Leu Arg Tyr Phe Asn Pro Val Gly Ala His Pro 
115 120 125 

Ser Gly Tyr Leu Gly Glu Asp Pro Arg Xaa He Pro Asn Asn Leu Met 
130 135 140 

Pro Tyr Val Gin Gin Val Ala Val Gly Lys Xaa Pro Ala Leu Thr Val 
145 150 155 160 

Leu Xaa Asn Asp Tyr Ala Thr Arg Asp 
165 

<210> 3 

<211> 456 

<212> DNA 

<213> Oryza sativa 

<220> 

<221> unsure 
<222> (322) 

<220> 

<221> unsure 
<222> (352) 

<220> 

<221> unsure 
<222> (391) 

<220> 

<221> unsure 
<222> (406) 



2 



WO 00/47755 



PCT/USOO/03453 



<220> 

<221> unsure 
<222> (447) 



<220> 

<221> unsure 
<222> (452) 



<400> 3 

atcactcttc ttcttccgct 
gcacacaaca acaacaagag 
cgatcctggt gacgggcggc 
aactcggctt ccgcgttgtc 
tccgcgtcag ggaactcgcc 
tccgcgacaa gcaagcgttg 
aatttgccgg gctgaaaact 
ctcatcggac ataaccacct 



ctctagcttt gctttgcttg 
taatcaaagt agaagaagat 
gccggctaca tcggcagcca 
gtcctcgaca acctcgacaa 
ggacacaacg ccaacaacct 
gnccaaatct tctcctctca 
gttggcgaaa ncgtgaaaaa 
gcagggnata gnggca 



cttcatcaaa ccccacacac 60 

ggtttcggcc ttgttgcgga 120 

caccgtcctc cagcttctcc 180 

cgcctccgag ctcgccatcc 240 

cgacttccgc aagggtgacc 300 

aaaggttgag gntgtcatcc 360 

cccctngttt tacgaaacaa 420 

456 



<210> 4 

<211> 99 

<212> PRT 

<213> Oryza sativa 



<220> 

<221> UNSURE 
<222> (69) 



<220> 

<221> UNSURE 
<222> (79) 



<220> 

<221> UNSURE 
<222> (92) 



<220> 

<221> UNSURE 
<222> (97) 



<400> 4 

Arg Thr lie Leu Val Thr Gly Gly 
1 5 

Val Leu Gin Leu Leu Gin Leu Gly 
20 

Leu Asp Asn Ala Ser Glu Leu Ala 
35 40 

Gly His Asn Ala Asn Asn Leu Asp 
50 55 

Lys Gin Ala Leu Xaa Gin lie Phe 
65 70 

He Gin Phe Ala Gly Leu Lys Thr 
85 



Ala Gly Tyr He Gly Ser His Thr 
10 15 

Phe Arg Val Val Val Leu Asp Asn 
25 30 

lie Leu Arg Val Arg Glu Leu Ala 
45 

Phe Arg Lys Gly Asp Leu Arg Asp 
60 

Ser Ser Gin Lys Val Glu Xaa Val 
75 80 

Val Gly Glu Xaa Val Lys Asn Pro 
90 95 



Xaa Phe Tyr 



WO 00/47755 



PCT/USOO/03453 



<210> 5 

<211> 479 

<212> DNA 

<213> Glycine max 

<220> 

<221> unsure 

<222> (17) 

<220> 

<221> unsure 

<222> (19) 

<220> 

<221> unsure 

<222> (27) 

<220> 

<221> unsure 

<222> (34) 

<220> 

<221> unsure 

<222> (48) 

<220> 

<221> unsure 

<222> (63) 



<220> 
<221> 
<222> 

<220> 
<221> 
<222> 

<220> 
<221> 
<222> 

<220> 
<221> 
<222> 

<220> 
<221> 
<222> 

<220> 
<221> 
<222> 

<220> 
<221> 
<222> 



unsure 
(66) 

unsure 
(81) 

unsure 
(99) 

unsure 
(189) 

unsure 
(225) 

unsure 
(360) 

unsure 
(369) 



4 
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<220> 

<221> unsure 
<222> (390) 



<220> 

<221> unsure 
<222> (446) 



<400> 5 

agcaattctc tccctcncna atgcagnatg 
ggnggnggcg gtttcattgg nacccacacc 
gtttcaataa tcgacaattt cgataactcc 
gtggttggnc ctctgctttc tcaaaacctc 
gatgacttgg agaaactctt ctccaaaaca 
ttgaaaagcg gttgctgaaa gcgttgcgaa 
tgggaccanc aacctctacg agtttatggn 
atcatctgca accgtttatg ggcaanctga 



gggncttcct cccaacanat tctggtcacc 60 
gtcgttcanc ttctcaaagc tggcttcagc 120 
gtcatggaag caatggaccg cgtccgccaa 180 
caattcaccc aaggngatct ccggaatagg 240 
acatttgatg ccgtgatcca ctttgctggc 300 
accccgtcgc tattttgatt ttaatttggn 360 
aaagtataat tgcaaaaaga tgggtttctc 420 
aaaaataccg tgtgaggagg attcaagtt 47 9 



<210> 6 

<211> 148 

<212> PRT 

<213> Glycine max 



<220> 

<221> UNSURE 
<222> (3) 



<220> 

<221> UNSURE 

<222> (7) 

<220> 

<221> UNSURE 

<222> (24) 

<220> 

<221> UNSURE 

<222> (111) 



<220> 

<221> UNSURE 
<222> (114) 



<220> 

<221> UNSURE 

<222> (121) 

<220> 

<221> UNSURE 

<222> (140) 



<400> 6 

Met Gly Xaa Ser Ser Gin Xaa lie 
1 5 

lie Gly Thr His Thr Val Val Xaa 
20 

Ser lie lie Asp Asn Phe Asp Asn 
35 40 



Leu Val Thr Gly Gly Gly Gly Phe 
10 15 

Leu Leu Lys Ala Gly Phe Ser Val 
25 30 

Ser Val Met Glu Ala Met Asp Arg 
45 
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Val Arg Gin Val Val Gly Pro Leu Leu Ser Gin Asn Leu Gin Phe Thr 
50 55 60 

Gin Gly Asp Leu Arg Asn Arg Asp Asp Leu Glu Lys Leu Phe Ser Lys 
65 70 75 80 

Thr Thr Phe Asp Ala Val lie His Phe Ala Gly Leu Lys Ala Val Ala 
85 90 95 

Glu Ser Val Ala Lys Pro Arg Arg Tyr Phe Asp Phe Asn Leu Xaa Gly 
100 105 110 

Thr Xaa Asn Leu Tyr Glu Phe Met Xaa Lys Tyr Asn Cys Lys Lys Met 
115 120 125 

Gly Phe Ser Ser Ser Ala Thr Val Tyr Gly Gin Xaa Glu Lys lie Pro 
130 135 140 

Cys Glu Glu Asp 
145 

<210> 7 
<211> 520 
<212> DNA 

<213> Triticum aestivum 



<220> 
<221> 
<222> 



unsure 
(498) 



<220> 
<221> 
<222> 



unsure 
(508) 



<220> 
<221> 
<222> 



unsure 
(513) 



<220> 
<221> 
<222> 



unsure 
(518) 



<400> 7 

ggagcgtgct 

tggagaaggg 

tcgaccgcgt 

gggacctgac 

tgatacactt 

accgcaacaa 

acaagttggt 

tcgaggactc 

gatctgcgcg 



ggtgaccggc 
ctacgccgtc 
ccgccacatc 
gatcaaggat 
cgccgggctc 
catcgtcggc 
gttctcgtcg 
ccccctcaag 
actaccanca 



ggcgcggggt 
accgccgtcg 
gtcggccccg 
gacctggaga 
aaggcggtgg 
accgtcaacc 
tcggcgaccg 
gccctcaacc 
cgccgaancc 



tcatcggcac 
acaacttcca 
ccctctccgc 
aggtcttcgc 
cggagagcgt 
tctacgacgt 
tgtacggcca 
cgtacggcag 
ggngtggngg 



gcacaccgtg 
caactccgtc 
ccgcctccaa 
cgccaagaag 
ggcgcacccg 
catgaagaag 
gccggagaag 
gaccaagctg 



ctgcagctgc 
cccgaggcgc 
tt cat ctt eg 
tacgacgccg 
gagatgtaca 
caegggtgea 
gtgccctgct 
tactggagga 



60 
120 
180 
240 
300 
360 
420 
480 
520 



<210> 8 

<211> 162 

<212> PRT 

<213> Triticum aestivum 
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<400> 8 

Val Leu Val Thr Gly Gly Ala Gly Phe lie Gly Thr His Thr Val Leu 
15 10 15 

Gin Leu Leu Glu Lys Gly Tyr Ala Val Thr Ala Val Asp Asn Phe His 
20 25 30 

Asn Ser Val Pro Glu Ala Leu Asp Arg Val Arg His lie Val Gly Pro 
35 40 45 

Ala Leu Ser Ala Arg Leu Gin Phe lie Phe Gly Asp Leu Thr He Lys 
50 55 60 

Asp Asp Leu Glu Lys Val Phe Ala Ala Lys Lys Tyr Asp Ala Val He 
65 70 75 80 

His Phe Ala Gly Leu Lys Ala Val Ala Glu Ser Val Ala His Pro Glu 
85 90 95 

Met Tyr Asn Arg Asn Asn lie Val Gly Thr Val Asn Leu Tyr Asp Val 
100 105 110 

Met Lys Lys His Gly Cys Asn Lys Leu Val Phe Ser Ser Ser Ala Thr 
115 120 125 

Val Tyr Gly Gin Pro Glu Lys Val Pro Cys Phe Glu Asp Ser Pro Leu 
130 135 140 

Lys Ala Leu Asn Pro Tyr Gly Arg Thr Lys Leu Tyr Trp Arg Arg Ser 
145 150 155 160 

Ala Arg 



<210> 9 

<211> 594 

<212> DNA 

<213> Zea mays 

<220> 

<221> unsure 

<222> (340) 

<220> 

<221> unsure 

<222> (345) 

<220> 

<221> unsure 

<222> (399) 

<220> 

<221> unsure 

<222> (407) 

<220> 

<221> unsure 

<222> (435) 
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<220> 

<221> unsure 

<222> (460) 

<220> 

<221> unsure 

<222> (523) 

<220> 

<221> unsure 

<222> (531) 



<220> 

<221> unsure 
<222> (568) 



<220> 

<221> unsure 
<222> (587) 



<220> 

<221> unsure 
<222> (592) 



<400> 9 

ggacgccgtc atccacttcg ctgggctgaa 
gatgtactac gagaacaacc tcatcggcac 
cggctgcaag aagctggttt tctcgtcatc 
cccatgcgtc gaggactcca agctgcaggc 
ccttgaggat atggcgcgtg actaccaccg 
gcgctacttc aaccccatcg gtgcgcacag 
ggataccgaa caacctgctg ccctacatcc 
tcaacgtcta cgggncacga ttaccccacc 
acacgtcgtc gaactcgccg atgggcacat 
tgatataagt tgtgtgggct acaatctngg 



ggccgtgggg gaaagcgtcg cgcacccgga 60 
catcaacctc tacaagagca tgaaggagca 120 
cgccaccgtg tacggctggc cggaggtgat 180 
cgccaaccca tacggcagga ccaagcttat 240 
cgcggacacg gagtggagca tcgtcctgct 300 
ctccggcgan atcgngagag gaccccaagg 360 
agcaggtcnc cgtcggnagg ctccccgagc 420 
cggggacggn accgcgatca gggactacat 480 
cgcaagggct cangaactct ncgactctcc 540 
ggtacaaggg cggcggnaca tncg 594 



<210> 10 

<211> 197 

<212> PRT 

<213> Zea mays 



<220> 

<221> UNSURE 
<222> (113) 



<220> 

<221> UNSURE 
<222> (115) 



<220> 

<221> UNSURE 

<222> (133) 

<220> 

<2 21> UNSURE 

<222> (136) 



<220> 

<221> UNSURE 
<222> (145) 
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<220> 

<221> UNSURE 
<222> (174) 

<220> 

<221> UNSURE 
<222> (177) 

<220> 

<221> UNSURE 
<222> (196) . . (197) 

<400> 10 

Asp Ala Val He His Phe Ala Gly Leu Lys Ala Val Gly Glu Ser Val 
15 10 15 

Ala His Pro Glu Met Tyr Tyr Glu Asn Asn Leu He Gly Thr He Asn 
20 25 30 

Leu Tyr Lys Ser Met Lys Glu His Gly Cys Lys Lys Leu Val Phe Ser 
35 40 45 

Ser Ser Ala Thr Val Tyr Gly Trp Pro Glu Val He Pro Cys Val Glu 
50 55 60 

Asp Ser Lys Leu Gin Ala Ala Asn Pro Tyr Gly Arg Thr Lys Leu He 
65 70 75 80 

Leu Glu Asp Met Ala Arg Asp Tyr His Arg Ala Asp Thr Glu Trp Ser 
85 90 95 

lie Val Leu Leu Arg Tyr Phe Asn Pro He Gly Ala His Ser Ser Gly 
100 105 110 

Xaa He Xaa Arg Gly Pro Gin Gly Asp Thr Glu Gin Pro Ala Ala Leu 
115 120 125 

His Pro Ala Gly Xaa Arg Arg Xaa Ala Pro Arg Ala Gin Arg Leu Arg 
130 135 140 

Xaa Thr He Thr Pro Pro Gly Asp Gly Thr Ala He Arg Asp Tyr He 
145 150 155 160 

His Val Val Glu Leu Ala Asp Gly His He Ala Arg Ala Xaa Glu Leu 
165 170 175 

Xaa Asp Ser Pro Asp He Ser Cys Val Gly Tyr Asn Leu Gly Val Gin 
180 185 190 

Gly Arg Arg Xaa Xaa 
195 

<210> 11 

<211> 300 

<212> DNA 

<213> Oryza sativa 



9 
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<220> 

<221> unsure 
<222> (18) 

<220> 

<221> unsure 
<222> (41) 

<220> 

<221> unsure 
<222> (85) 

<220> 

<221> unsure 
<222> (154) 

<220> 

<221> unsure 
<222> (177) 

<220> 

<221> unsure 
<222> (180) 

<220> 

<221> unsure 
<222> (184) 

<220> 

<221> unsure 
<222> (202) 

<220> 

<221> unsure 
<222> (209) 

<220> 

<221> unsure 
<222> (214) 

<220> 

<221> unsure 
<222> (227) 

<220> 

<221> unsure 
<222> (247) 

<220> 

<221> unsure 
<222> (252) 

<220> 

<221> unsure 
<222> (283) 

<220> 

<221> unsure 
<222> (285) 
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<400> 11 

aggacttaaa agacaggnac aactggaata agtgttacgc ngccaagagg tatgacgccg 60 
tgatccactt cgccgggctg aagcngtggg ggagagcgtc gcgcaacccg cagatgtact 120 
acgaggacaa cgtcgccggc accatgaacc tctnctccgc cttgaccaag tacggcngcn 180 
agangatagt gttctcgtcg tnggcgacng tgtncggcca gccgganaag accccctgcg 240 
tcgaggnttc cnagctgagc gctctcaacc catacggcgc cancnggctc gtcctggaga 300 

<210> 12 

<211> 99 

<212> PRT 

<213> Oryza sativa 

<220> 

<221> UNSURE 
<222> (6) 

<220> 

<221> UNSURE 
<222> (28) 

<220> 

<221> UNSURE 
<222> (51) 

<220> 

<221> UNSURE 

<222> (59) . . (60) . . (61) 

<220> 

<221> UNSURE 
<222> (67) 

<220> 

<221> UNSURE 
<222> (71) 

<220> 

<221> UNSURE 
<222> (75) 

<220> 

<221> UNSURE 
<222> (82) 

<220> 

<221> UNSURE 
<222> (84) 

<220> 

<221> UNSURE 
<222> (94) . . (95) 

<400> 12 

Asp Leu Lys Asp Arg Xaa Asn Trp Asn Lys Cys Tyr Ala Ala Lys Arg 
15 10 15 

Tyr Asp Ala Val lie His Phe Ala Gly Leu Lys Xaa Trp Gly Arg Ala 
20 25 30 
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Ser Arg Asn Pro Gin Met Tyr Tyr Glu Asp Asn Val Ala Gly Thr Met 
35 40 45 

Asn Leu Xaa Ser Ala Leu Thr Lys Tyr Gly Xaa Xaa Xaa He Val Phe 
50 55 60 

Ser Ser Xaa Ala Thr Val Xaa Gly Gin Pro Xaa Lys Thr Pro Cys Val 
65 70 75 80 

Glu Xaa Ser Xaa Leu Ser Ala Leu Asn Pro Tyr Gly Ala Xaa Xaa Leu 
85 90 95 

Val Leu Glu 



<210> 13 

<211> 1312 

<212> DNA 

<213> Glycine max 



<400> 13 

gcacgagcca 

gtcaccggtg 

ttcagcgttt 

cgccaagtgg 

aatagggatg 

gctggcttga 

ttggttggca 

ttctcatcat 

aagttacaag 

gatattcaga 

ggggctcatg 

ccttacattc 

tatccaacga 

ggccatattg 

aacctgggaa 

tctggcaaga 

tatgcatcta 

gagatgtgca 

ccttgaatta 

tctcttatat 

tgacaataaa 

taaattagac 



cttctctccc 
gtgccggttt 
caataatcga 
ttggccctct 
acttggagaa 
aagcggttgc 
ccatcaacct 
ctgcaaccgt 
ctatgaatcc 
aagctgaacc 
aaagtggcaa 
agcaagtagc 
gggatggctc 
ctgccctgcg 
ctggtcgtgg 
aaattccagt 
cagagagagc 
gggaccaatg 
gcttgagaaa 
agaatacttt 
aatt tggcag 
tggccattga 



tctctattgc 
cattggcacc 
caatttcgat 
gctttctcag 
actcttctcc 
tgaaagcgtt 
ctacgagttt 
ttatggccaa 
ctatggacgg 
agaatggaag 
actcggtgaa 
tgttggaaga 
tgcgatccgg 
aaagctcttc 
aacatctgtg 
aaaattatgt 
tgagaaagaa 
gaattgggca 
tatactgctc 
tatgtttgat 
catttcaaga 
tttgatattg 



agcatggtgt 
cacaccgtcg 
aactccgtca 
aacctccaat 
aaaacaacat 
gcgaagcccc 
atggcaaagt 
cctgaaaaga 
accaagcttt 
atcatattac 
gatcccaagg 
ttgactgaac 
gactatatcc 
acaacggaga 
cttgaaatgg 
ccaagaagac 
cttggttgga 
aagaacaatc 
atctacgaat 
gatttgttta 
agttaaagct 
aaaaaaaaaa 



cttcctccca 
ttcagcttct 
tggaagcagt 
tcacccaggg 
ttgatgccgt 
gtcgctattt 
ataattgcaa 
taccgtgtga 
tcctggaaga 
tgagatactt 
gcatcccaaa 
tcaatgtata 
atgtgatgga 
acataggttg 
ttacagcatt 
cgggagatgc 
aggcaaacta 
cctggggtta 
gcttttcaca 
ggcagttcgt 
atgtattt aa 
aaaaaaaaaa 



acacattctg 
caaagctggc 
ggaccgcgtc 
cgatctccgg 
gatccacttt 
tgattttaat 
aaagatggtt 
ggaggatttc 
aattgcccga 
caatccagtt 
taacctcatg 
cggtcatgat 
cttggcagat 
tactgcttac 
tgaaaaggct 
gactgaggtt 
tggtgtggag 
cgcggggaag 
taaataggca 
tgtataatct 
acaataactt 
aa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1312 



<210> 14 

<211> 350 

<212> PRT 

<213> Glycine max 

<400> 14 

Met Val Ser Ser Ser Gin His He Leu Val Thr Gly Gly Ala Gly Phe 
15 10 15 

He Gly Thr His Thr Val Val Gin Leu Leu Lys Ala Gly Phe Ser Val 
20 25 30 



Ser lie He Asp Asn Phe Asp Asn Ser Val Met Glu Ala Val Asp Arg 
35 40 45 
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Val Arg Gin Val Val Gly Pro Leu Leu Ser Gin Asn Leu Gin Phe Thr 
50 55 60 

Gin Gly Asp Leu Arg Asn Arg Asp Asp Leu Glu Lys Leu Phe Ser Lys 
65 70 75 80 

Thr Thr Phe Asp Ala Val lie His Phe Ala Gly Leu Lys Ala Val Ala 
85 90 95 

Glu Ser Val Ala Lys Pro Arg Arg Tyr Phe Asp Phe Asn Leu Val Gly 
100 105 110 

Thr lie Asn Leu Tyr Glu Phe Met Ala Lys Tyr Asn Cys Lys Lys Met 
115 120 125 

Val Phe Ser Ser Ser Ala Thr Val Tyr Gly Gin Pro Glu Lys lie Pro 
130 135 140 

Cys Glu Glu Asp Phe Lys Leu Gin Ala Met Asn Pro Tyr Gly Arg Thr 
145 150 155 160 

Lys Leu Phe Leu Glu Glu lie Ala Arg Asp lie Gin Lys Ala Glu Pro 
165 170 175 

Glu Trp Lys lie lie Leu Leu Arg Tyr Phe Asn Pro Val Gly Ala His 
180 185 190 

Glu Ser Gly Lys Leu Gly Glu Asp Pro Lys Gly lie Pro Asn Asn Leu 
195 200 205 

Met Pro Tyr lie Gin Gin Val Ala Val Gly Arg Leu Thr Glu Leu Asn 
210 215 220 

Val Tyr Gly His Asp Tyr Pro Thr Arg Asp Gly Ser Ala lie Arg Asp 
225 230 235 240 

Tyr lie His Val Met Asp Leu Ala Asp Gly His lie Ala Ala Leu Arg 
245 250 255 

Lys Leu Phe Thr Thr Glu Asn lie Gly Cys Thr Ala Tyr Asn Leu Gly 
260 265 270 

Thr Gly Arg Gly Thr Ser Val Leu Glu Met Val Thr Ala Phe Glu Lys 
275 280 285 

Ala Ser Gly Lys Lys lie Pro Val Lys Leu Cys Pro Arg Arg Pro Gly 
290 295 300 

Asp Ala Thr Glu Val Tyr Ala Ser Thr Glu Arg Ala Glu Lys Glu Leu 
305 310 315 320 

Gly Trp Lys Ala Asn Tyr Gly Val Glu Glu Met Cys Arg Asp Gin Trp 

325 330 335 

Asn Trp Ala Lys Asn Asn Pro Trp Gly Tyr Ala Gly Lys Pro 
340 345 350 

<210> 15 
<211> 1516 
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<212> DNA 

<213> Triticum aestivum 



<220> 

<221> unsure 
<222> (1458) . 



(1459) 



<400> 15 

gcacgaggga 

cagctgctgg 

gaggcgctcg 

atcttcgggg 

gacgccgtga 

atgtacaacc 

gggtgcaaca 

ccctgcttcg 

ctggaggaga 

cgctacttca 

gtccccaaca 

aacgtctacg 

gtggtcgacc 

atcggctgtg 

agcgcgttcg 

ggcgattcgg 

gccaagtacg 

tatggctact 

ttgtaacgtg 

tgtttcctgt 

actgggccgg 

tgcttccttt 

aaaaaaaaaa 

agaaggaaag 

cgcgacggag 

ggggcctcaa 



gcgtgctggt 
agaagggcta 
accgcgtccg 
acctgacgat 
tacacttcgc 
gcaacaacat 
agttggtgtt 
aggactcccc 
tgctgcgcga 
accccatcgg 
acctgctccc 
gccacgacta 
tcgccgacgg 
tggcgtacaa 
agaaggcata 
agcaggtgta 
ggatcgagga 
gcggcaacgc 
aaagaaaaga 
tgtggtacta 
gctgttcctt 
tttttcaaat 
aaagaaaaaa 
agaaaaagaa 
aaagcgcnnt 
tccccg 



gaccggcggc 
cgccgtcacc 
ccacatcgtc 
caaggatgac 
cgggctcaag 
cgtcggcacc 
ctcgtcgtcg 
cctcaaggcc 
ctaccagcac 
cgcacacgag 
ctacatccag 
ccgcacccqc 
ccacatcgcg 
cctggggacg 
cggcaagaaa 
cgcgtccacc 
gatgtgcagg 
tgctgagaac 
tgtgtcaata 
ttcgtaagtt 
ggtgaagaat 
gatagagcga 
aaaaagacaa 
ggcgcaaagc 
ttcaggccgg 



gcggggttca 
gccgtcgaca 
ggccccgccc 
ctggagaagg 
gcggtggcgg 
gtcaacctct 
gcgaccgtgt 
ctcaacccgt 
gcgaacccgg 
agcggcgaca 
caggtggccg 
gacggcaccg 
gcgctcgaga 
gggcgcggga 
atcccggtga 
gccaaggccg 
gaccagtgga 
aaagactgat 
agcccagggc 
ggaacttgag 
ttggtctggt 
gaccgatgag 
aaaaaaaacg 
ggggccccgc 
gggggggggg 



tcggcacgca 
acttccacaa 
tctccgcccg 
tcttcgccgc 
agagcgtggc 
acgacgtcat 
acggccagcc 
acggcaggac 
agtggaggac 
tcggggagga 
tcgcccgccg 
ccgtcaggga 
agctcttcgc 
cgacggtgct 
agatgtgccc 
aagaggagct 
actgggccaa 
tcggtggccc 
attaaagtgt 
ttgggttaga 
tt cgaacatg 
gcaaaaaaaa 
agaaaggaga 
cgaacggacc 
ggaaccccgt 



caccgtgctg 
ctccgtcccc 
cctccaattc 
caagaagtac 
gcacccggag 
gaagaagcac 
ggagaaggtg 
caagctgtac 
gatcctgctg 
ccccaagggc 
ccccgagctc 
ctacatccac 
cacccctgac 
ggagatggtg 
caggaggccg 
cggctggagg 
gaagaacccg 
gtcgcgagcc 
gcccagaaaa 
ctggactgtc 
ggccgtcatc 
aaaaaaaaaa 
aaaaaatgac 
gacggcgcgc 
ttccctaagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1516 



<210> 16 
<211> 352 
<212> PRT 

<213> Triticum aestivum 
<400> 16 

Ala Arg Gly Ser Val Leu Val Thr Gly Gly Ala Gly Phe He Gly Thr 
15 10 15 

His Thr Val Leu Gin Leu Leu Glu Lys Gly Tyr Ala Val Thr Ala Val 
20 25 30 

Asp Asn Phe His Asn Ser Val Pro Glu Ala Leu Asp Arg Val Arg His 
35 40 45 

He Val Gly Pro Ala Leu Ser Ala Arg Leu Gin Phe He Phe Gly Asp 
50 55 60 

Leu Thr He Lys Asp Asp Leu Glu Lys Val Phe Ala Ala Lys Lys Tyr 
65 70 75 80 



Asp Ala Val lie His Phe Ala Gly Leu Lys Ala Val Ala Glu Ser Val 
85 90 95 
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Ala His Pro Glu Met Tyr Asn Arg Asn Asn lie Val Gly Thr Val Asn 
100 105 110 

Leu Tyr Asp Val Met Lys Lys His Gly Cys Asn Lys Leu Val Phe Ser 
115 120 125 

Ser Ser Ala Thr Val Tyr Gly Gin Pro Glu Lys Val Pro Cys Phe Glu 
130 135 140 

Asp Ser Pro Leu Lys Ala Leu Asn Pro Tyr Gly Arg Thr Lys Leu Tyr 
145 150 155 160 

Leu Glu Glu Met Leu Arg Asp Tyr Gin His Ala Asn Pro Glu Trp Arg 
165 170 175 

Thr lie Leu Leu Arg Tyr Phe Asn Pro lie Gly Ala His Glu Ser Gly 
180 185 190 

Asp lie Gly Glu Asp Pro Lys Gly Val Pro Asn Asn Leu Leu Pro Tyr 
195 200 205 

lie Gin Gin Val Ala Val Ala Arg Arg Pro Glu Leu Asn Val Tyr Gly 
210 215 220 

His Asp Tyr Arg Thr Arg Asp Gly Thr Ala Val Arg Asp Tyr lie His 
225 230 235 240 

Val Val Asp Leu Ala Asp Gly His lie Ala Ala Leu Glu Lys Leu Phe 
245 250 255 

Ala Thr Pro Asp lie Gly Cys Val Ala Tyr Asn Leu Gly Thr Gly Arg 
260 265 270 

Gly Thr Thr Val Leu Glu Met Val Ser Ala Phe Glu Lys Ala Tyr Gly 
275 280 285 

Lys Lys lie Pro Val Lys Met Cys Pro Arg Arg Pro Gly Asp Ser Glu 
290 295 300 

Gin Val Tyr Ala Ser Thr Ala Lys Ala Glu Glu Glu Leu Gly Trp Arg 
305 310 315 320 

Ala Lys Tyr Gly lie Glu Glu Met Cys Arg Asp Gin Trp Asn Trp Ala 
325 330 335 

Lys Lys Asn Pro Tyr Gly Tyr Cys Gly Asn Ala Ala Glu Asn Lys Asp 
340 345 350 

<210> 17 
<211> 1393 
<212> DNA 
<213> Zea mays 

<400> 17 

gcacgaggat tgatctccgt gacaagggag cactggaaat ggtttttgct tctacaagat 60 

ttgaagctgt cattcacttc gctggattga aagctgtggg tgaaagcgta cagaagccat 120 

tactttatta tgacaacaac gtcattggca cgataaatct tctagaagtt atgtctgttc 180 

acggttgcaa gaagttggtg ttctcatcat cagctgcagt ttatggatca cccaaaaact 240 

caccctgcac agaaaatttt cctcttactc caaacaatcc atatggcaaa acaaagctcg 300 

ttgttgaaga tatttgccgg gatatctacc gttcagatcc tgaatggaag atcattttac 360 
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ttaggtactt caatccagtt ggtgctcatc 
gaattcccaa caatcttatg ccctatgttc 
taacagtttt aggaaatgac tatgcaacaa 
atgtggttga ccttgctgac ggacatattg 
gcatagggtg tgaagcgtac aaccttggaa 
ttaaagcatt tgagaaggct tctgggaaga 
caggtgatgc agagattctg ttttcagaga 
aagcaaaata cggtattgaa gagatgtgcc 
cttatggcta tggatcacct gactctatca 
ctgactcctc caagcagaat ggccaccgca 
acggccacca tgcgtatggg tctgctgact 
gatcatcaga cctcaagccg aatggtaatg 
tgtgagctcc ctgtacattc ggttgcgatg 
tcgtgaaccc actatccgag attgatgtgg 
agccggtggc agaggaattc ctgtttgctg 
ttttcttgct tctttcgatt atggtgtagg 
ctttccctgc taattggact ttattacgct 
aaaaaaaaaa aaa 



ctagtggata tcttggcgag gacccacgag 420 
agcaagttgc ggttggtagg aggccagctc 480 
gagatgggac tggggtccga gattacatcc 540 
ctgcattgca gaagcttttt gagaactcta 600 
ccggaagagg tacatctgtg ctggagattg 660 
aaatacctct gatttttggt gaaagacgcc 720 
ctactaaagc agagagggag cttaactgga 780 
gcgaccaatg gaactgggcc agcaagaacc 84 0 
agcagaatgg tcaccaaaca aacggatccg 900 
caaacggttc aactgactca cccaagcgga 960 
cacccaagcg caacgggcac tgcgtttttg 1020 
gccacctgcg ctgagcagaa ctgtttggcc 1080 
tgagctccct gcacgttcgg tcgaggtcta 1140 
atcattgggt tgacaggtca tacagtatag 1200 
tgggtaaagc ttatcttctg ctttcgtgtt 1260 
aatgtggtca taatgtatta gctgattatc 1320 
tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1380 

1393 



<210> 18 

<211> 353 

<212> PRT 

<213> Zea mays 



<400> 18 

Thr Arg lie Asp Leu Arg Asp Lys Gly Ala Leu Glu Met Val Phe Ala 
15 10 15 

Ser Thr Arg Phe Glu Ala Val lie His Phe Ala Gly Leu Lys Ala Val 

20 25 30 

Gly Glu Ser Val Gin Lys Pro Leu Leu Tyr Tyr Asp Asn Asn Val lie 
35 40 45 

Gly Thr He Asn Leu Leu Glu Val Met Ser Val His Gly Cys Lys Lys 
50 55 60 

Leu Val Phe Ser Ser Ser Ala Ala Val Tyr Gly Ser Pro Lys Asn Ser 
65 70 75 80 

Pro Cys Thr Glu Asn Phe Pro Leu Thr Pro Asn Asn Pro Tyr Gly Lys 
85 90 95 

Thr Lys Leu Val Val Glu Asp He Cys Arg Asp lie Tyr Arg Ser Asp 
100 105 110 

Pro Glu Trp Lys He lie Leu Leu Arg Tyr Phe Asn Pro Val Gly Ala 
115 120 125 

His Pro Ser Gly Tyr Leu Gly Glu Asp Pro Arg Gly lie Pro Asn Asn 
130 135 140 

Leu Met Pro Tyr Val Gin Gin Val Ala Val Gly Arg Arg Pro Ala Leu 
145 150 155 160 

Thr Val Leu Gly Asn Asp Tyr Ala Thr Arg Asp Gly Thr Gly Val Arg 
165 170 175 

Asp Tyr lie His Val Val Asp Leu Ala Asp Gly His He Ala Ala Leu 
180 185 190 



16 



WO 00/47755 



PCT/US00/03453 



Gin Lys Leu Phe Glu 
195 

Gly Thr Gly Arg Gly 
210 

Lys Ala Ser Gly Lys 
225 

Gly Asp Ala Glu lie 
245 

Leu Asn Trp Lys Ala 
260 

Trp Asn Trp Ala Ser 
275 

lie Lys Gin Asn Gly 
290 

Gin Asn Gly His Arg 
305 

Gly His His Ala Tyr 

325 

Cys Val Phe Gly Ser 
340 



Asn Ser Ser lie Gly Cys 
200 

Thr Ser Val Leu Glu lie 
215 

Lys lie Pro Leu lie Phe 
230 235 

Leu Phe Ser Glu Thr Thr 
250 

Lys Tyr Gly lie Glu Glu 

265 

Lys Asn Pro Tyr Gly Tyr 
280 

His Gin Thr Asn Gly Ser 
295 

Thr Asn Gly Ser Thr Asp 
310 315 

Gly Ser Ala Asp Ser Pro 
330 

Ser Asp Leu Lys Pro Asn 
345 



Glu Ala Tyr Asn Leu 
205 

Val Lys Ala Phe Glu 
220 

Gly Glu Arg Arg Pro 
240 

Lys Ala Glu Arg Glu 
255 

Met Cys Arg Asp Gin 

270 

Gly Ser Pro Asp Ser 
285 

Ala Asp Ser Ser Lys 
300 

Ser Pro Lys Arg Asn 
320 

Lys Arg Asn Gly His 
335 

Gly Asn Gly His Leu 
350 



Arg 



<210> 19 

<211> 1498 

<212> DNA 

<213> Oryza sativa 



<400> 19 

gcacgagatc actcttcttc ttccgctctc 
cacacacgca cacaacaaca acaagagtaa 
ttgcggacga tcctggtgac gggcggcgcc 
cttctccaac tcggcttccg cgttgtcgtc 
gccatcctcc gcgtcaggga actcgccgga 
gttgacctcc gcgacaagca agcgttggac 
gtcatccatt ttgccgggct gaaagctgtt 
tacgacaaca acctcatcgg caccatcact 
accaagctgg tgttctcatc atccgcaact 
actgaagaat ccccactttg tgcaatgaac 
gacatgtgcc gggatctgca tgcctcagac 
ttcaaccctg ttggagctca cccaagcggg 
aacaacctca tgcccttcgt ccagcaggtc 
tatggaaccg actacaacac caaggatgga 
gatctagcgg atggtcatat cgccgcgtta 
ggatgtgagg tgtacaatct gggcactgga 
gcattcgaga aagcttctgg aaagaaaatc 
gatgccgaga tcgtttacgc tcaaactgcc 
aaatacgggg tagaggagat gtgcagggac 
gggtatggat cgccggacag tagcaactga 



tagctttgct ttgcttgctt catcaaaccc 60 

tcaaagtaga agaagatggt ttcggccttg 120 

ggctacatcg gcagccacac cgtcctccag 180 

ctcgacaacc tcgacaacgc ctccgagctc 240 

cacaacgcca acaacctcga cttccgcaag 300 

caaatcttct cctctcaaag gtttgaggct 360 

ggcgagagcg tgcagaagcc cctgctttac 420 

ctcctgcagg tcatggccgc acatggctgc 4 80 

gtctacgggt ggcccaagga ggtgccctgc 54 0 

ccctacggca gaacaaagct ggtaatcgaa 600 

ccaaactgga agatcatact gctccgatac 660 

tacattggtg aggacccctg cggcatccca 720 

gctgttggca ggaggccggc ccttaccgtc 780 

actggggttc gtgactatat ccatgttgtt 840 

aggaagctct atgaagattc tgatagaata 900 

aaggggacat ctgtgctgga aatggttgca 960 

ccgcttgtat ttgctggacg aaggcctgga 1020 

aaagctgaga aggaactgaa atggaaggca 1080 

ctgtggaatt gggcgagcaa gaacccctac 114 0 

tccagctgaa tataggcgtc caatcctcca 1200 
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gtagcagcag cagcagcatg acttctatac atatatatat ataatcataa agaatgaaga 1260 

aacaaagaat tcggacttgt tgagttacta ctactactac tactaatccc atctgatgga 1320 

ccgcattgta tagggggctt gtaggggtcc agcagcttca tcatcagtct ccttaggagg 1380 

cctctaatat aatctccata tttatggtag aaataaattt tgcccaccgt ggaagaacta 1440 

tataatagaa tcatgatgat ttgttgatta aaaaaaaaaa aaaaaaaaaa aaaaaaaa 1498 

<210> 20 

<211> 354 

<212> PRT 

<213> Oryza sativa 

<400> 20 

Met Val Ser Ala Leu Leu Arg Thr lie Leu Val Thr Gly Gly Ala Gly 
15 10 15 

Tyr lie Gly Ser His Thr Val Leu Gin Leu Leu Gin Leu Gly Phe Arg 
20 25 30 

Val Val Val Leu Asp Asn Leu Asp Asn Ala Ser Glu Leu Ala lie Leu 
35 40 45 

Arg Val Arg Glu Leu Ala Gly His Asn Ala Asn Asn Leu Asp Phe Arg 
50 55 60 

Lys Val Asp Leu Arg Asp Lys Gin Ala Leu Asp Gin lie Phe Ser Ser 
65 70 75 80 

Gin Arg Phe Glu Ala Val lie His Phe Ala Gly Leu Lys Ala Val Gly 
85 90 95 

Glu Ser Val Gin Lys Pro Leu Leu Tyr Tyr Asp Asn Asn Leu lie Gly 
100 105 110 

Thr lie Thr Leu Leu Gin Val Met Ala Ala His Gly Cys Thr Lys Leu 
115 120 125 

Val Phe Ser Ser Ser Ala Thr Val Tyr Gly Trp Pro Lys Glu Val Pro 
130 135 140 

Cys Thr Glu Glu Ser Pro Leu Cys Ala Met Asn Pro Tyr Gly Arg Thr 
145 150 155 160 

Lys Leu Val lie Glu Asp Met Cys Arg Asp Leu His Ala Ser Asp Pro 
165 170 175 

Asn Trp Lys lie lie Leu Leu Arg Tyr Phe Asn Pro Val Gly Ala His 
180 185 190 

Pro Ser Gly Tyr lie Gly Glu Asp Pro Cys Gly lie Pro Asn Asn Leu 
195 200 205 

Met Pro Phe Val Gin Gin Val Ala Val Gly Arg Arg Pro Ala Leu Thr 
210 215 220 

Val Tyr Gly Thr Asp Tyr Asn Thr Lys Asp Gly Thr Gly Val Arg Asp 
225 230 235 240 

Tyr lie His Val Val Asp Leu Ala Asp Gly His lie Ala Ala Leu Arg 
245 250 255 
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Lys Leu Tyr Glu 
260 

Gly Thr Gly Lys 
275 

Lys Ala Ser Gly 
290 

Gly Asp Ala Glu 
305 

Leu Lys Trp Lys 



Trp Asn Trp Ala 
340 



Ser Asn 



Asp Ser Asp Arg 



Gly Thr Ser Val 
280 

Lys Lys lie Pro 

295 

He Val Tyr Ala 
310 

Ala Lys Tyr Gly 
325 

Ser Lys Asn Pro 



He Gly Cys Glu 
265 

Leu Glu Met Val 



Leu Val Phe Ala 
300 

Gin Thr Ala Lys 
315 

Val Glu Glu Met 
330 

Tyr Gly Tyr Gly 
345 



Val Tyr Asn Leu 
270 

Ala Ala Phe Glu 

285 

Gly Arg Arg Pro 



Ala Glu Lys Glu 
320 

Cys Arg Asp Leu 

335 

Ser Pro Asp Ser 
350 



<210> 21 

<211> 1532 

<212> DNA 

<213> Glycine max 

<400> 21 

gaattcggca cgagcgcaaa ctttcttcca aacgaacgtg tcacaaaatt ctcgccttct 60 

ccgaatatgg catcgcgcgt cagcattggc aaccttacct cctccgcgcc gtatattaat 120 

tcccctcact ttcgctcacc acttaagatt tccaacaacc cctctctgca aaacgcttcg 180 

cataaggtac ttatgcgcga taagactgta ctggtaaccg gcggagccgg ttacatcggc 240 

agccacaccg ttcttcagct cttgctcgga ggtttcagag ccgtcgtcct cgacaacctc 300 

gaaaattcct ccgaggttgc catccacaga gtcagggagc tcgccggcga atttgggaac 360 

aacctctcct ttcacaaggt ggacctacgg gacagagctg ctctagacca aatattttct 420 

tccacacaat tcgatgctgt catacatttt gctggactga aagcagtagg agaaagtgtg 480 

caaaaacctt tactatacta taacaacaac ttgactggga caatcactct attggaagtc 540 

atggctgccc atggatgcaa gaagctcgtg ttttcatctt cagcaactgt atatggttgg 600 

ccaaaggagg ttccatgcac agaagagttc cctctgtcag caatgaaccc atatggacga 660 

actaagctta tcattgaaga aatttgccgt gatgtccact gtgcagagcc agattgtaaa 720 

ataattttgt taagatactt caacccagtt ggtgcacacc ccagtggtta tattggggag 780 

gatcctcgtg gaattccaaa caatctcatg ccatttgttc agcaagtagc agttggccga 84 0 

cggcctgcac tgacagtttt tggaaatgat tataatacaa gtgatggcac tggggttcgg 900 

gattacattc atgttgttga tttagcagat gggcacattg ctgcattgct taaactagat 960 

gaacctaata taggttgtga ggtttataac ctgggaacag gaaagggaac atcagttttg 1020 

gagatggtta gagcttttga aatggcatct ggaaagaaaa ttccacttgt gatggctggc 1080 

cgtagacctg gtgatgctga aattgtttat gcatcaacaa agaaagcgga aagagagctt 1140 

aaatggaagg caaaatatgg cattgatgag atgtgccgtg atcaatggaa ttgggctagc 1200 

aaaaaccctt atggctatgg agatcagggc tccaccgatt aaccacttag ttttctcttt 1260 

gggttctttt ctgaactcac ccacaccgta gtccgtaggt cttgtgaatt tagttttccc 1320 

aaaagctttt ctttctttag tgatcttaag gtgacaaagt acttgtatta ttactattca 1380 

tagttacata gtaagtaagt agtggtttac tatactgtaa tttaaaggtt ctctaggttc 1440 

cttcttacag gttattgatt attagattcg gattctctca tgttccacat gagcagcatc 1500 

ctgttttgta aatctaaatc acatgtttgt tt 1532 

<210> 22 

<211> 349 

<212> PRT 

<213> Glycine max 
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<400> 22 

Met Arg Asp Lys Thr Val Leu Val Thr Gly Gly Ala Gly Tyr lie Gly 
15 10 15 

Ser His Thr Val Leu Gin Leu Leu Leu Gly Gly Phe Arg Ala Val Val 
20 25 30 

Leu Asp Asn Leu Glu Asn Ser Ser Glu Val Ala He His Arg Val Arg 
35 40 45 

Glu Leu Ala Gly Glu Phe Gly Asn Asn Leu Ser Phe His Lys Val Asp 
50 55 60 • 

Leu Arq Asp Arg Ala Ala Leu Asp Gin He Phe Ser Ser Thr Gin Phe 
65 70 75 80 

Asp Ala Val He His Phe Ala Gly Leu Lys Ala Val Gly Glu Ser Val 
85 90 95 

Gin Lys Pro Leu Leu Tyr Tyr Asn Asn Asn Leu Thr Gly Thr He Thr 
100 105 110 

Leu Leu Glu Val Met Ala Ala His Gly Cys Lys Lys Leu Val Phe Ser 
115 120 125 

Ser Ser Ala Thr Val Tyr Gly Trp Pro Lys Glu Val Pro Cys Thr Glu 
130 135 - 140 

Glu Phe Pro Leu Ser Ala Met Asn Pro Tyr Gly Arg Thr Lys Leu He 
145 150 155 160 

He Glu Glu lie Cys Arg Asp Val His Cys Ala Glu Pro Asp Cys Lys 
165 170 175 

He He Leu Leu Arg Tyr Phe Asn Pro Val Gly Ala His Pro Ser Gly 
180 185 190 

Tyr He Gly Glu Asp Pro Arg Gly He Pro Asn Asn Leu Met Pro Phe 
195 200 205 

Val Gin Gin Val Ala Val Gly Arg Arg Pro Ala Leu Thr Val Phe Gly 
210 215 220 

Asn Asp Tyr Asn Thr Ser Asp Gly Thr Gly Val Arg Asp Tyr He His 
225 230 235 240 

Val Val Asp Leu Ala Asp Gly His He Ala Ala Leu Leu Lys Leu Asp 
245 250 255 

Glu Pro Asn He Gly Cys Glu Val Tyr Asn Leu Gly Thr Gly Lys Gly 
260 265 270 

Thr Ser Val Leu Glu Met Val Arg Ala Phe Glu Met Ala Ser Gly Lys 
275 280 285 

Lys He Pro Leu Val Met Ala Gly Arg Arg Pro Gly Asp Ala Glu He 
290 295 300 

Val Tyr Ala Ser Thr Lys Lys Ala Glu Arg Glu Leu Lys Trp Lys Ala 
305 310 315 320 
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Lvs Tyr Gly He Asp Glu Met Cys Arg Asp Gin Trp Asn Trp Ala Ser 

325 330 335 

Lys Asn Pro Tyr Gly Tyr Gly Asp Gin Gly Ser Thr Asp 
340 345 

<210> 23 
<211> 490 
<212> DNA 

<213> Triticum aestivum 
<220> 

<221> unsure 
<222> (61) 

<220> 

<221> unsure 
<222> (73) 

<220> 

<221> unsure 
<222> (81) 

<220> 

<221> unsure 
<222> (207) 

<220> 

<221> unsure 
<222> (246) 

<220> 

<221> unsure 
<222> (284) 

<220> 

<221> unsure 
<222> (319) 

<220> 

<221> unsure 
<222> (332) 

<220> 

<221> unsure 
<222> (378) . . (379) 

<220> 

<221> unsure 
<222> (409) 

<220> 

<221> unsure 
<222> (413) 

<220> 

<221> unsure 
<222> (418) 
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<220> 

<221> unsure 
<222> (455) 



<220> 

<221> unsure 
<222> (461) 



<220> 

<221> unsure 
<222> (468) 



<220> 

<221> unsure 
<222> (480) 



<220> 

<221> unsure 

<222> (482) 

<400> 23 



aagaaacaag agagcaagga agaagaagat ggtgtctgcg gtgttgagga cgattcctgg 60 

ntgaccggcg gcncggggta natcggcagc cacaccgtgc tgcagctgct cctgcagggc 120 

ttccgcgtcc tcgtagtcga cagcctcgac aacgcctccg aggaggccat ccgccgcgtc 180 

cgacaactcg ccaacgcccc gcaaaanagc ctcgacttcc gcaaggtgga ccttcgtgac 24 0 

aaggangcgc tcgaccaaat cttctcctcc caaaggtatc ttcnactttt ttccgcaaaa 300 

l-attatta anaattaact atacrtatatt attqaqtcca 360 



aagaagtatc ttttttcgng 


caaattaaat gttgattnnt 


tgctgctata gcaaatacga 


cncctacgag 


<210> 


24 


<211> 


103 


<212> 


PRT 


<213> 


Triticum aesti' 


<220> 




<221> 


UNSURE 


<222> 


(4) 


<220> 




<221> 


UNSURE 


<222> 


(6) 


<220> 




<221> 


UNSURE 


<222> 


(48) 


<220> 




<221> 


UNSURE 


<222> 


(61) 


<220> 




<221> 


UNSURE 


<222> 


(74) 


<220> 




<221> 


UNSURE 


<222> 


(86) 



4 90 
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<220> 

<221> UNSURE 
<222> (90) 

<400> 24 

Thr Gly Gly Xaa Gly Xaa lie Gly Ser His Thr Val Leu Gin Leu Leu 
15 10 15 

Leu Gin Gly Phe Arg Val Leu Val Val Asp Ser Leu Asp Asn Ala Ser 
20 25 30 

Glu Glu Ala lie Arg Arg Val Arg Gin Leu Ala Asn Ala Pro Gin Xaa 
35 40 45 

Ser Leu Asp Phe Arg Lys Val Asp Leu Arg Asp Lys Xaa Ala Leu Asp 
50 55 60 

Gin lie Phe Ser Ser Gin Arg Tyr Leu Xaa Leu Phe Ser Ala Lys Lys 
65 70 75 80 

Lys Tyr Leu Phe Ser Xaa Leu Leu Leu Xaa lie Asn Tyr Ser He Leu 
85 90 95 

Leu Ser Pro Gin He Lys Cys 
100 

<210> 25 

<211> 350 

<212> PRT 

<213> Pisum sativum 

<400> 25 

Met Val Ala Ser Ser Gin Lys He Leu Val Thr Gly Ser Ala Gly Phe 
15 10 15 

He Gly Thr His Thr Val Val Gin Leu Leu Asn Asn Gly Phe Asn Val 
20 25 30 

Ser He He Asp Asn Phe Asp Asn Ser Val Met Glu Ala Val Glu Arg 
35 40 45 

Val Arg Glu Val Val Gly Ser Asn Leu Ser Gin Asn Leu Glu Phe Thr 
50 55 60 

Leu Gly Asp Leu Arg Asn Lys Asp Asp Leu Glu Lys Leu Phe Ser Lys 
65 70 75 80 

Ser Lys Phe Asp Ala Val He His Phe Ala Gly Leu Lys Ala Val Gly 
85 90 95 

Glu Ser Val Glu Asn Pro Arg Arg Tyr Phe Asp Asn Asn Leu Val Gly 
100 105 110 

Thr He Asn Leu Tyr Glu Val Met Ala Lys His Asn Cys Lys Lys Met 
115 120 125 

Val Phe Ser Ser Ser Ala Thr Val Tyr Gly Gin Pro Glu Lys He Pro 
130 135 140 
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Cys Val Glu Asp Phe Lys Leu Gin Ala Met Asn Pro Tyr Gly Arg Thr 
145 150 155 160 

Lys Leu Phe Leu Glu Glu He Ala Arg Asp He Gin Lys Ala Glu Pro 
165 170 175 

Glu Trp Arg lie Val Leu Leu Arg Tyr Phe Asn Pro Val Gly Ala His 
180 185 190 

Glu Ser Gly Lys Leu Gly Glu Asp Pro Arg Gly lie Pro Asn Asn Leu 
195 200 205 

Met Pro Tyr He Gin Gin Val Ala Val Gly Arg Leu Pro Glu Leu Asn 
210 215 220 

Val Tyr Gly His Asp Tyr Pro Thr Arg Asp Gly Ser Ala He Arg Asp 
225 230 235 240 

Tyr lie His Val Met Asp Leu Ala Asp Gly His He Ala Ala Leu Arg 
245 250 255 

Lys Leu Phe Thr Ser Glu Asn He Gly Cys Thr Ala Tyr Asn Leu Gly 
260 265 270 

Thr Gly Arg Gly Ser Ser Val Leu Glu Met Val Ala Ala Phe Glu Lys 
275 280 285 

Ala Ser Gly Lys Lys He Ala Leu Lys Leu Cys Pro Arg Arg Pro Gly 
290 295 300 

Asp Ala Thr Glu Val Tyr Ala Ser Thr Ala Lys Ala Glu Lys Glu Leu 
305 310 315 320 

Gly Trp Lys Ala Lys Tyr Gly Val Glu Glu Met Cys Arg Asp Gin Trp 
325 330 335 

Asn Trp Ala Lys Asn Asn Pro Trp Gly Tyr Ser Gly Lys Pro 
340 345 350 

<210> 26 
. <211> 350 
<212> PRT 

<213> Cyamopsis tetragonoloba 
<400> 26 

Met Ser Ser Gin Thr Val Leu Val Thr Gly Gly Ala Gly Tyr He Gly 
15 10 15 

Ser His Thr Val Leu Gin Leu Leu Leu Gly Gly Phe Lys Ala Val Val 
20 25 30 

Val Asp Asn Leu Asp Asn Ser Ser Glu Thr Ala He His Arg Val Lys 
35 40 45 

Glu Leu Ala Gly Lys Phe Ala Gly Asn Leu Ser Phe His Lys Leu Asp 
50 55 60 

Leu Arg Asp Arg Asp Ala Leu Glu Lys He Phe Ser Ser Thr Lys Phe 
65 70 75 80 
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Asp Ser Val lie His Phe Ala Gly Leu Lys Ala Val Gly Glu Ser Val 
85 90 95 

Gin Lys Pro Leu Leu Tyr Tyr Asp Asn Asn Leu lie Gly Thr lie Val 
100 105 110 

Leu Phe Glu Val Met Ala Ala His Gly Cys Lys Lys Leu Val Phe Ser 
115 120 125 

Ser Ser Ala Thr Val Tyr Gly Leu Pro Lys Glu Val Pro Cys Thr Glu 
130 135 140 

Glu Phe Pro Leu Ser Ala Ala Asn Pro Tyr Gly Arg Thr Lys Leu lie 
145 150 155 160 

lie Glu Glu lie Cys Arg Asp lie Tyr Arg Ala Glu Gin Glu Trp Lys 
165 170 175 

lie lie Leu Leu Arg Tyr Phe Asn Pro Val Gly Ala His Pro Ser Gly 
180 185 190 

Tyr lie Gly Glu Asp Pro Arg Gly He Pro Asn Asn Leu Met Pro Phe 
195 200 205 

Val Gin Gin Val Ala Val Gly Arg Arg Pro Ala Leu Thr Val Phe Gly 
210 215 220 

Asn Asp Tyr Thr Thr Ser Asp Gly Thr Gly Val Arg Asp Tyr He His 
225 230 235 240 

Val Val Asp Leu Ala Asp Gly His He Ala Ala Leu Arg Lys Leu Asn 
245 250 255 

Asp Pro Lys He Gly Cys Glu Val Tyr Asn Leu Gly Thr Gly Lys Gly 
260 265 270 

Thr Ser Val Leu Glu Met Val Lys Ala Phe Glu Gin Ala Ser Gly Lys 
275 280 285 

Lys He Pro Leu Val Met Ala Gly Arg Arg Pro Gly Asp Ala Glu Val 
290 295 300 

Val Tyr Ala Ser Thr Asn Lys Ala Glu Arg Glu Leu Asn Trp Lys Ala 
305 310 315 320 

Lys Tyr Gly He Asp Glu Met Cys Arg Asp Gin Trp Asn Trp Ala Ser 
325 330 335 

Lys Asn Pro Tyr Gly Tyr Gly Gly Ser Glu Asp Ser Ser Asn 
340 345 350 

<210> 27 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 



25 
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<400> 27 
catggaggag cag 

<210> 28 

<211> 9 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Se 

<400> 28 
ctgctcctc 
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PCT/US00/03453 

13 

; oligonucleotide 

9 
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SEQUENCE LISTING 

<110> Carlson, Thomas J". 
Fader, Gary M. 
Famodu, Omolayo O. 
Kinney, Anthony J . 
Pearlstein, Richard W. 
Rafalski, J. Antoni 
Thorpe, Catherine J. 

<120> UDP-Glucose Modifiers 

<130> BB-1321-1 

<140> 09/913,064 
<141> 

<150> PCT/US00/03453 

<151> 2000-02-09 

<150> 60/119,588 
<151> 1999-02-10 

<160> 28 

<170> Microsoft Office 97 

<210> 1 
<211> 509 
<212> DNA 
<213> Zea mays 

<220> 

<221> unsure 
<222> (413) 

<223> n = A, C, G, or T 
<220> 

<221> unsure 
<222> (465) 

<223> n = A, C, G, or T 
<220> 

<221> unsure 
<222> (485) 

<223> n = A, C, G, or T 
<400> 1 

gattgatctc cgtgacaagg gagcactgga aatggttttt gcttctacaa gatttgaagc 60 
tgtcattcac ttcgctggat tgaaagctgt gggtgaaagc gtacagaagc cattacttta 120 
ttatgacaac aacgtcattg gcacgataaa tcttctagaa gttatgtctg ttcacggttg 180 
caagaagttg gtgttctcat catcagctgc agtttatgga tcacccaaaa actcaccctg 240 
cacagaaaat tttcctctta ctccaaacaa tccatatggc aaaacaaagc tcgttgttga 300 
agatatttgc cgggatatct accgttcaga tcctgaatgg aagatcattt tacttaggta 360 
cttcaatcca gttggtgctc atcctagtgg atatcttggc gaggacccac gangaattcc 42 0 
caacaatctt atgccctatg ttcagcaagt tgcggttggt aagangccag ctctaacagt 480 
tttangaaat gactatgcaa caagagatg 509 



<210> 2 
<211> 169 
<212> PRT 



<213> Zea mays 



<220> 

<221> UNSURE 
<222> (138) 

<22 3> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (155) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (162) 

<22 3> Xaa = ANY AMINO ACID 



<400> 2 

lie Asp Leu Arg 
1 

Arg Phe Glu Ala 
20 

Ser Val Gin Lys 
35 

lie Asn Leu Leu 
50 

Phe Ser Ser Ser 
65 

Thr Glu Asn Phe 



Leu Val Val Glu 
100 

Trp Lys lie lie 
115 

Ser Gly Tyr Leu 
130 

Pro Tyr Val Gin 
145 



Asp Lys Gly Ala 
5 

Val lie His Phe 



Pro Leu Leu Tyr 
40 

Glu Val Met Ser 
55 

Ala Ala Val Tyr 
70 

Pro Leu Thr Pro 
85 

Asp He Cys Arg 



Leu Leu Arg Tyr 
120 

Gly Glu Asp Pro 
135 

Gin Val Ala Val 
150 



Leu Glu Met Val 
10 

Ala Gly Leu Lys 
25 

Tyr Asp Asn Asn 



Val His Gly Cys 
60 

Gly Ser Pro Lys 
75 

Asn Asn Pro Tyr 
90 

Asp He Tyr Arg 
105 

Phe Asn Pro Val 



Arg Xaa He Pro 
140 

Gly Lys Xaa Pro 
155 



Phe Ala Ser Thr 
15 

Ala Val Gly Glu 
30 

Val He Gly Thr 
45 

Lys Lys Leu Val 



Asn Ser Pro Cys 
80 

Gly Lys Thr Lys 
95 

Ser Asp Pro Glu 
110 

Gly Ala His Pro 
125 

Asn Asn Leu Met 



Ala Leu Thr Val 
160 



Leu Xaa Asn Asp Tyr Ala Thr Arg Asp 
165 



<210> 3 
<211> 456 
<212> DNA 

<213> Oryza sativa 



<220> 

<221> unsure 
<222> (322) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (352) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 

<222> (391) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 

<222> (406) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (447) 

<223> n = A, C, G, OR T 

<220> 

<221> unsure 
<222> (452) 

<223> n = A, C, G, OR T 



<400> 3 

atcactcttc 

gcacacaaca 

cgatcctggt 

aactcggctt 

tccgcgtcag 

tccgcgacaa 

aatttgccgg 

ctcatcggac 



ttcttccgct 
acaacaagag 
gacgggcggc 
ccgcgttgtc 
ggaactcgcc 
gcaagcgttg 
gctgaaaact 
ataaccacct 



ctctagcttt 
taatcaaagt 
gccggctaca 
gtcctcgaca 
ggacacaacg 
gnccaaatct 
gttggcgaaa 
gcagggnata 



gctttgcttg 
agaagaagat 
tcggcagcca 
acctcgacaa 
ccaacaacct 
tctcctctca 
ncgtgaaaaa 
gnggca 



cttcatcaaa 
ggtttcggcc 
caccgtcctc 
cgcctccgag 
cgacttccgc 
aaaggttgag 
cccctngttt 



ccccacacac 60 
ttgttgcgga 120 
cagcttctcc 180 
ctcgccatcc 240 
aagggtgacc 300 
gntgtcatcc 360 
tacgaaacaa 420 
456 



<210> 4 
<211> 99 
<212> PRT 

<213> Oryza sativa 



<220> 

<221> UNSURE 
<222> (69) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (79) 

<223> Xaa - ANY AMINO ACID 



<220> 

<221> UNSURE 

<222> (92) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 

<222> (97) 

<22 3> Xaa = ANY AMINO ACID 
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Arg Thr lie Leu Val Thr Gly Gly 
1 5 

Val Leu Gin Leu Leu Gin Leu Gly 
20 

Leu Asp Asn Ala Ser Glu Leu Ala 
35 40 

Gly His Asn Ala Asn Asn Leu Asp 
50 55 

Lys Gin Ala Leu Xaa Gin He Phe 
65 70 

He Gin Phe Ala Gly Leu Lys Thr 
85 

Xaa Phe Tyr 



Ala Gly Tyr lie Gly Ser His Thr 
10 15 

Phe Arg Val Val Val Leu Asp Asn 
25 30 

lie Leu Arg Val Arg Glu Leu Ala 
45 

Phe Arg Lys Gly Asp Leu Arg Asp 
60 

Ser Ser Gin Lys Val Glu Xaa Val 
75 80 

Val Gly Glu Xaa Val Lys Asn Pro 
90 95 



<210> 5 

<211> 479 

<212> DNA 

<213> Glycine max 

<220> 

<221> unsure 
<222> (17) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (19) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (27) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (34) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (48) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (63) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (66) 

<223> n = A, C, G, OR T 
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<220> 

<221> unsure 
<222> (81) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (99) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (189) 

<223> n = A, C, G, OR T 



<220> 

<22 1> unsure 
<222> (225) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (360) 

<223> n - A, C, G, OR T 



<220> 

<221> unsure 
<222> (369) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (390) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (446) 

<223> n = A, C, G, OR T 



<400> 5 

agcaattctc 

ggnggnggcg 

gtttcaataa 

gtggttggnc 

gatgacttgg 

ttgaaaagcg 

tgggaccanc 

atcatctgca 



tccctcncna 
gtttcattgg 
tcgacaattt 
ctctgctttc 
agaaactctt 
gttgctgaaa 
aacctctacg 
accgtttatg 



atgcagnatg 
nacccacacc 
cgataactcc 
tcaaaacctc 
ctccaaaaca 
gcgttgcgaa 
agtttatggn 
ggcaanctga 



gggncttcct 
gtcgttcanc 
gtcatggaag 
caattcaccc 
acatttgatg 
accccgtcgc 
aaagtataat 
aaaaataccg 



cccaacanat 
ttctcaaagc 
caatggaccg 
aaggngatct 
ccgtgatcca 
tattttgatt 
tgcaaaaaga 
tgtgaggagg 



tctggtcacc 60 
tggcttcagc 120 
cgtccgccaa 180 
ccggaatagg 240 
ctttgctggc 300 
ttaatttggn 360 
tgggtttctc 420 
attcaagtt 479 



<210> 6 

<211> 148 

<212> PRT 

<213> Glycine max 



<220> 

<221> UNSURE 

<222> (3) 

<223> Xaa = ANY AMINO ACID 



<220> 
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<221> UNSURE 
<222> (7) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (24) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (111) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (114) 

<22 3> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 

<222> (121) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (140) 

<22 3> Xaa = ANY AMINO ACID 



<400> 6 

Met Gly Xaa Ser 
1 

lie Gly Thr His 
20 

Ser lie lie Asp 
35 

Val Arg Gin Val 
50 

Gin Gly Asp Leu 
65 

Thr Thr Phe Asp 



Glu Ser Val Ala 
100 

Thr Xaa Asn Leu 
115 

Gly Phe Ser Ser 
130 



Ser Gin Xaa lie 
5 

Thr Val Val Xaa 



Asn Phe Asp Asn 
40 

Val Gly Pro Leu 
55 

Arg Asn Arg Asp 
70 

Ala Val lie His 
85 

Lys Pro Arg Arg 



Tyr Glu Phe Met 
120 

Ser Ala Thr Val 
135 



Leu Val Thr Gly 
10 

Leu Leu Lys Ala 
25 

Ser Val Met Glu 



Leu Ser Gin Asn 
60 

Asp Leu Glu Lys 
75 

Phe Ala Gly Leu 
90 

Tyr Phe Asp Phe 
105 

Xaa Lys Tyr Asn 



Tyr Gly Gin Xaa 
140 



Gly Gly Gly Phe 
15 

Gly Phe Ser Val 
30 

Ala Met Asp Arg 
45 

Leu Gin Phe Thr 



Leu Phe Ser Lys 
80 

Lys Ala Val Ala 
95 

Asn Leu Xaa Gly 
110 

Cy s Ly s Ly s Me t 
125 

Glu Lys lie Pro 



Cys Glu Glu Asp 
145 



<210> 7 
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<211> 520 
<212> DNA 

<213> Triticum aestivum 



<220> 

<221> unsure 
<222> (498) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (508) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (513) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (518) 

<223> n = A # C, G, OR T 



<400> 7 

ggagcgtgct ggtgaccggc ggcgcggggt 
tggagaaggg ctacgccgtc accgccgtcg 
tcgaccgcgt ccgccacatc gtcggccccg 
gggacctgac gatcaaggat gacctggaga 
tgatacactt cgccgggctc aaggcggtgg 
accgcaacaa catcgtcggc accgtcaacc 
acaagttggt gttctcgtcg tcggcgaccg 
tcgaggactc ccccctcaag gccctcaacc 
gatctgcgcg actaccanca cgccgaancc 

<210> 8 
<211> 162 
<212> PRT 

<213> Triticum aestivum 



tcatcggcac gcacaccgtg ctgcagctgc 60 
acaacttcca caactccgtc cccgaggcgc 120 
ccctctccgc ccgcctccaa ttcatcttcg 180 
aggtcttcgc cgccaagaag tacgacgccg 240 
cggagagcgt ggcgcacccg gagatgtaca 3 00 
tctacgacgt catgaagaag cacgggtgca 360 
tgtacggcca gccggagaag gtgccctgct 42 0 
cgtacggcag gaccaagctg tactggagga 480 
ggngt ggngg 520 



<400> 8 

Val Leu Val Thr 
1 

Gin Leu Leu Glu 
20 

Asn Ser Val Pro 
35 

Ala Leu Ser Ala 
50 

Asp Asp Leu Glu 
65 

His Phe Ala Gly 



Met Tyr Asn Arg 
100 



Gly Gly Ala Gly 
5 

Lys Gly Tyr Ala 

Glu Ala Leu Asp 
40 

Arg Leu Gin Phe 
55 

Lys Val Phe Ala 
70 

Leu Lys Ala Val 
85 

Asn Asn He Val 



Phe lie Gly Thr 
10 

Val Thr Ala Val 
25 

Arg Val Arg His 



lie Phe Gly Asp 
60 

Ala Lys Lys Tyr 
75 

Ala Glu Ser Val 
90 

Gly Thr Val Asn 
105 



His Thr Val Leu 
15 

Asp Asn Phe His 
30 

lie Val Gly Pro 
45 

Leu Thr lie Lys 



Asp Ala Val lie 
80 

Ala His Pro Glu 
95 

Leu Tyr Asp Val 
110 
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Met Lys Lys His Gly Cys Asn Lys Leu Val Phe Ser Ser Ser Ala Thr 
115 120 125 

Val Tyr Gly Gin Pro Glu Lys Val Pro Cys Phe Glu Asp Ser Pro Leu 
130 135 140 

Lys Ala Leu Asn Pro Tyr Gly Arg Thr Lys Leu Tyr Trp Arg Arg Ser 
145 150 155 160 

Ala Arg 



<210> 9 
<211> 594 
<212> DNA 
<213> Zea mays 

<220> 

<221> unsure 
<222> (340) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (345) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (399) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (407) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (435) 

<223> n = A, C, G, OR T 
<220> 

<2 21> unsure 
<222> (460) 

<223> n = A, C, G, OR T 
<220> 

<2 21> unsure 
<222> (523) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (531) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (568) 



<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (587) 

<223> n = A # C, G, OR T 



<220> 

<221> unsure 
<222> (592) 

<223> n = A, C # G, OR T 



<400> 9 

ggacgccgtc atccacttcg ctgggctgaa ggccgtgggg gaaagcgtcg cgcacccgga 60 
gatgtactac gagaacaacc tcatcggcac catcaacctc tacaagagca tgaaggagca 120 
cggctgcaag aagctggttt tctcgtcatc cgccaccgtg tacggctggc cggaggtgat 180 
cccatgcgtc gaggactcca agctgcaggc cgccaaccca tacggcagga ccaagcttat 240 
ccttgaggat atggcgcgtg actaccaccg cgcggacacg gagtggagca tcgtcctgct 3 00 
gcgctacttc aaccccatcg gtgcgcacag ctccggcgan atcgngagag gaccccaagg 3 60 
ggataccgaa caacctgctg ccctacatcc agcaggtcnc cgtcggnagg ctccccgagc 420 
tcaacgtcta cgggncacga ttaccccacc cggggacggn accgcgatca gggactacat 480 
acacgtcgtc gaactcgccg atgggcacat cgcaagggct cangaactct ncgactctcc 540 
tgatataagt tgtgtgggct acaatctngg ggtacaaggg cggcggnaca tncg 5 94 



<210> 10 
<211> 197 
<212> PRT 
<213> Zea mays 



<220> 

<2 21> UNSURE 
<222> (113) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (115) 

<22 3> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (133) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (136) 

<22 3> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (145) 

<22 3> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 

<222> (174) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
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<222> (177) 

<223> Xaa = ANY AMINO ACID 
<220> 

<221> UNSURE 

<222> (196) . . (197) 

<223> Xaa = ANY AMINO ACID 

<400> 10 

Asp Ala Val lie His Phe Ala Gly Leu Lys Ala Val Gly Glu Ser Val 
15 10 15 

Ala His Pro Glu Met Tyr Tyr Glu Asn Asn Leu lie Gly Thr lie Asn 
20 25 30 

Leu Tyr Lys Ser Met Lys Glu His Gly Cys Lys Lys Leu Val Phe Ser 
35 40 45 

Ser Ser Ala Thr Val Tyr Gly Trp Pro Glu Val lie Pro Cys Val Glu 
50 55 60 

Asp Ser Lys Leu Gin Ala Ala Asn Pro Tyr Gly Arg Thr Lys Leu lie 
65 70 75 80 

Leu Glu Asp Met Ala Arg Asp Tyr His Arg Ala Asp Thr Glu Trp Ser 
85 90 95 

lie Val Leu Leu Arg Tyr Phe Asn Pro lie Gly Ala His Ser Ser Gly 
100 105 110 

Xaa lie Xaa Arg Gly Pro Gin Gly Asp Thr Glu Gin Pro Ala Ala Leu 
115 120 125 

His Pro Ala Gly Xaa Arg Arg Xaa Ala Pro Arg Ala Gin Arg Leu Arg 
130 135 140 

Xaa Thr lie Thr Pro Pro Gly Asp Gly Thr Ala lie Arg Asp Tyr lie 
145 150 155 160 

His Val Val Glu Leu Ala Asp Gly His lie Ala Arg Ala Xaa Glu Leu 
165 170 175 

Xaa Asp Ser Pro Asp lie Ser Cys Val Gly Tyr Asn Leu Gly Val Gin 
180 185 190 

Gly Arg Arg Xaa Xaa 
195 

<210> 11 
<211> 300 
<212> DNA 

<213> Oryza sativa 
<220> 

<221> unsure 
<222> (18) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (41) 
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<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (85) 

<223> n - A # C, G, OR T 
<220> 

<221> unsure 
<222> (154) 

<223> n = A, C # G, OR T 
<220> 

<221> unsure 
<222> (177) 

<223> n = A, C, G, OR T 
<220> 

<2 21> unsure 
<222> (180) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (184) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (202) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (209) 

<223> n = A, C, G, OR T 
<220> 

<2 21> unsure 
<222> (214) 

<223> n = A, C, G, OR T 
<220> 

<2 21> unsure 
<222> (227) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (247) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (252) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (283) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (285) 

<223> n = A, C, G, OR T 
<400> 11 

aggacttaaa agacaggnac aactggaata agtgttacgc ngccaagagg tatgacgccg 6 
tgatccactt cgccgggctg aagcngtggg ggagagcgtc gcgcaacccg cagatgtact 12 
acgaggacaa cgtcgccggc accatgaacc tctnctccgc cttgaccaag tacggcngcn 18 
agangatagt gttctcgtcg tnggcgacng tgtncggcca gccgganaag accccctgcg 24 
tcgaggnttc cnagctgagc gctctcaacc catacggcgc cancnggctc gtcctggaga 3 0 

<210> 12 
<211> 99 
<212> PRT 

<213> Oryza sativa 
<220> 

<221> UNSURE 
<222> (6) 

<223> Xaa = ANY AMINO ACID 
<220> 

<221> UNSURE 
<222> (28) 

<223> Xaa = ANY AMINO ACID 
<220> 

<221> UNSURE 
<222> (51) 

<223> Xaa = ANY AMINO ACID 
<220> 

<221> UNSURE 

<222> (59) . . (61) 

<223> Xaa = ANY AMINO ACID 

<220> 

<221> UNSURE 
<222> (67) 

<223> Xaa = ANY AMINO ACID 
<220> 

<221> UNSURE 
<222> (71) 

<22 3> Xaa = ANY AMINO ACID 
<220> 

<221> UNSURE 
<222> (75) 

<223> Xaa = ANY AMINO ACID 
<220> 

<221> UNSURE 
<222> (82) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (84) 
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<223> Xaa = ANY AMINO ACID 
<220> 

<221> UNSURE 

<222> (94) . . (95) 

<223> Xaa = ANY AMINO ACID 

<400> 12 

Asp Leu Lys Asp Arg Xaa Asn Trp Asn Lys Cys Tyr Ala Ala Lys Arg 
15 10 15 

Tyr Asp Ala Val lie His Phe Ala Gly Leu Lys Xaa Trp Gly Arg Ala 
20 25 30 

Ser Arg Asn Pro Gin Met Tyr Tyr Glu Asp Asn Val Ala Gly Thr Met 
35 40 45 

Asn Leu Xaa Ser Ala Leu Thr Lys Tyr Gly Xaa Xaa Xaa lie Val Phe 
50 55 60 

Ser Ser Xaa Ala Thr ; Val Xaa Gly Gin Pro Xaa Lys Thr Pro Cys Val 
65 70 75 80 

Glu Xaa Ser Xaa Leu Ser Ala Leu Asn Pro Tyr Gly Ala Xaa Xaa Leu 
85 90 95 

Val Leu Glu 



<210> 13 

<211> 1312 

<212> DNA 

<213> Glycine max 

v <400> 13 
gcacgagcca cttctctccc tctctattgc 
gtcaccggtg gtgccggttt cattggcacc 
ttcagcgttt caataatcga caatttcgat 
cgccaagtgg ttggccctct gctttctcag 
aatagggatg acttggagaa actcttctcc 
gctggcttga aagcggttgc tgaaagcgtt 
ttggttggca ccatcaacct ctacgagttt 
ttctcatcat ctgcaaccgt ttatggccaa 
aagttacaag ctatgaatcc ctatggacgg 
gatattcaga aagctgaacc agaatggaag 
ggggctcatg aaagtggcaa actcggtgaa 
ccttacattc agcaagtagc tgttggaaga 
tatccaacga gggatggctc tgcgatccgg 
ggccatattg ctgccctgcg aaagctcttc 
aacctgggaa ctggtcgtgg aacatctgtg 
tctggcaaga aaattccagt aaaattatgt 
tatgcatcta cagagagagc tgagaaagaa 
gagatgtgca gggaccaatg gaattgggca 
ccttgaatta gcttgagaaa tatactgctc 
tctcttatat agaatacttt tatgtttgat 
tgacaataaa aatttggcag catttcaaga 
taaattagac tggccattga tttgatattg 

<210> 14 
<211> 350 
<212> PRT 



agcatggtgt cttcctccca acacattctg 60 
cacaccgtcg ttcagcttct caaagctggc 120 
aactccgtca tggaagcagt ggaccgcgtc 180 
aacctccaat tcacccaggg cgatctccgg 240 
aaaacaacat ttgatgccgt gatccacttt 300 
gcgaagcccc gtcgctattt tgattttaat 360 
atggcaaagt ataattgcaa aaagatggtt 42 0 
cctgaaaaga taccgtgtga ggaggatttc 480 
accaagcttt tcctggaaga aattgcccga 540 
atcatattac tgagatactt caatccagtt 600 
gatcccaagg gcatcccaaa taacctcatg 660 
ttgactgaac tcaatgtata cggtcatgat 720 
gactatatcc atgtgatgga cttggcagat 780 
acaacggaga acataggttg tactgcttac 840 
cttgaaatgg ttacagcatt tgaaaaggct 900 
ccaagaagac cgggagatgc gactgaggtt 960 
cttggttgga aggcaaacta tggtgtggag 1020 
aagaacaatc cctggggtta cgcggggaag 1080 
atctacgaat gcttttcaca taaataggca 1140 
gatttgttta ggcagttcgt tgtataatct 1200 
agttaaagct atgtatttaa acaataactt 1260 
aaaaaaaaaa aaaaaaaaaa aa 1312 



<213> Glycine max 



<400> 14 

Met Val Ser Ser Ser Gin His lie Leu Val Thr Gly Gly Ala Gly Phe 
15 10 15 

lie Gly Thr His Thr Val Val Gin Leu Leu Lys Ala Gly Phe Ser Val 
20 25 30 

Ser lie lie Asp Asn Phe Asp Asn Ser Val Net Glu Ala Val Asp Arg 
35 40 45 

Val Arg Gin Val Val Gly Pro Leu Leu Ser Gin Asn Leu Gin Phe Thr 
50 55 60 

Gin Gly Asp Leu Arg Asn Arg Asp Asp Leu Glu Lys Leu Phe Ser Lys 
65 70 75 80 

Thr Thr Phe Asp Ala Val lie His Phe Ala Gly Leu Lys Ala Val Ala 
85 90 95 

Glu Ser Val Ala Lys Pro Arg Arg Tyr Phe Asp Phe Asn Leu Val Gly 
100 105 110 

Thr lie Asn Leu Tyr Glu Phe Met Ala Lys Tyr Asn Cys Lys Lys Met 
115 120 125 

Val Phe Ser Ser Ser Ala Thr Val Tyr Gly Gin Pro Glu Lys lie Pro 
130 135 140 

Cys Glu Glu Asp Phe Lys Leu Gin Ala Met Asn Pro Tyr Gly Arg Thr 
145 150 155 160 

Lys Leu Phe Leu Glu Glu lie Ala Arg Asp lie Gin Lys Ala Glu Pro 
165 170 175 

Glu Trp Lys lie lie Leu Leu Arg Tyr Phe Asn Pro Val Gly Ala His 
180 185 190 

Glu Ser Gly Lys Leu Gly Glu Asp Pro Lys Gly lie Pro Asn Asn Leu 
195 200 205 

Met Pro Tyr lie Gin Gin Val Ala Val Gly Arg Leu Thr Glu Leu Asn 
210 215 220 

Val Tyr Gly His Asp Tyr Pro Thr Arg Asp Gly Ser Ala lie Arg Asp 
225 230 235 240 

Tyr lie His Val Met Asp Leu Ala Asp Gly His lie Ala Ala Leu Arg 
245 250 255 

Lys Leu Phe Thr Thr Glu Asn He Gly Cys Thr Ala Tyr Asn Leu Gly 
260 265 270 

Thr Gly Arg Gly Thr Ser Val Leu Glu Met Val Thr Ala Phe Glu Lys 
275 280 285 

Ala Ser Gly Lys Lys He Pro Val Lys Leu Cys Pro Arg Arg Pro Gly 
290 295 300 



Asp Ala Thr Glu Val Tyr Ala Ser Thr Glu Arg Ala Glu Lys Glu Leu 
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305 



310 



315 



320 



Gly Trp Lys Ala Asn Tyr Gly Val Glu Glu Met Cys Arg Asp Gin Trp 
325 330 335 

Asn Trp Ala Lys Asn Asn Pro Trp Gly Tyr Ala Gly Lys Pro 
340 345 350 

<210> 15 
<211> 1516 
<212> DNA 

<213> Triticum aestivum 
<220> 

<221> unsure 

<222> (1458) . . (1459) 

<223> n = A, C, G, OR T 



<400> 15 

gcacgaggga 

cagctgctgg 

gaggcgctcg 

atcttcgggg 

gacgccgtga 

atgtacaacc 

gggtgcaaca 

ccctgcttcg 

ctggaggaga 

cgctacttca 

gtccccaaca 

aacgtctacg 

gtggtcgacc 

atcggctgtg 

agcgcgttcg 

ggcgattcgg 

gccaagtacg 

tatggctact 

ttgtaacgtg 

tgtttcctgt 

actgggccgg 

tgcttccttt 

aaaaaaaaaa 

agaaggaaag 

cgcgacggag 

ggggcctcaa 



gcgtgctggt 

agaagggcta 

accgcgtccg 

acctgacgat 

tacacttcgc 

gcaacaacat 

agttggtgtt 

aggactcccc 

tgctgcgcga 

accccatcgg 

acctgctccc 

gccacgacta 

tcgccgacgg 

tggcgtacaa 

agaaggcata 

agcaggtgta 

ggatcgagga 

gcggcaacgc 

aaagaaaaga 

tgtggtacta 

gctgttcctt 

tttttcaaat 

aaagaaaaaa 

agaaaaagaa 

aaagcgcnnt 

tccccg 



gaccggcggc 

cgccgtcacc 

ccacatcgtc 

caaggatgac 

cgggctcaag 

cgtcggcacc 

ctcgtcgtcg 

cctcaaggcc 

ctaccagcac 

cgcacacgag 

ctacatccag 

ccgcacccgc 

ccacatcgcg 

cctggggacg 

cggcaagaaa 

cgcgtccacc 

gatgtgcagg 

tgctgagaac 

tgtgtcaata 

ttcgtaagtt 

ggtgaagaat 

gatagagcga 

aaaaagacaa 

ggcgcaaagc 

ttcaggccgg 



gcggggttca 

gccgtcgaca 

ggccccgccc 

ctggagaagg 

gcggtggcgg 

gtcaacctct 

gcgaccgtgt 

ctcaacccgt 

gcgaacccgg 

agcggcgaca 

caggtggccg 

gacggcaccg 

gcgctcgaga 

gggcgcggga 

atcccggtga 

gccaaggccg 

gaccagtgga 

aaagactgat 

agcccagggc 

ggaacttgag 

ttggtctggt 

gaccgatgag 

aaaaaaaacg 

ggggccccgc 

gggggggggg 



tcggcacgca 
acttccacaa 
tctccgcccg 
tcttcgccgc 
agagcgtggc 
acgacgtcat 
acggccagcc 
acggcaggac 
agtggaggac 
tcggggagga 
tcgcccgccg 
ccgtcaggga 
agctcttcgc 
cgacggtgct 
agatgtgccc 
aagaggagct 
actgggccaa 
tcggtggccc 
attaaagtgt 
ttgggttaga 
ttcgaacatg 
gcaaaaaaaa 
agaaaggaga 
cgaacggacc 
ggaaccccgt 



caccgtgctg 
ctccgtcccc 
cctccaattc 
caagaagtac 
gcacccggag 
gaagaagcac 
ggagaaggtg 
caagctgtac 
gatcctgctg 
ccccaagggc 
ccccgagctc 
ctacatccac 
cacccctgac 
ggagatggtg 
caggaggccg 
cggctggagg 
gaagaacccg 
gtcgcgagcc 
gcccagaaaa 
ctggactgtc 
ggccgtcatc 
aaaaaaaaaa 
aaaaaatgac 
gacggcgcgc 
ttccctaagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1516 



<210> 16 
<211> 352 
<212> PRT 

<213> Triticum aestivum 



<400> 16 

Ala Arg Gly Ser Val Leu Val Thr Gly Gly Ala Gly Phe lie Gly Thr 
15 10 15 

His Thr Val Leu Gin Leu Leu Glu Lys Gly Tyr Ala Val Thr Ala Val 
20 25 30 

Asp Asn Phe His Asn Ser Val Pro Glu Ala Leu Asp Arg Val Arg His 
35 40 45 
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lie Val Gly Pro Ala Leu Ser Ala Arg Leu Gin Phe lie Phe Gly Asp 
50 55 60 

Leu Thr lie Lys Asp Asp Leu Glu Lys Val Phe Ala Ala Lys Lys Tyr 
65 70 75 80 

Asp Ala Val He His Phe Ala Gly Leu Lys Ala Val Ala Glu Ser Val 
85 90 95 

Ala His Pro Glu Met Tyr Asn Arg Asn Asn He Val Gly Thr Val Asn 
100 105 110 

Leu Tyr Asp Val Met Lys Lys His Gly Cys Asn Lys Leu Val Phe Ser 
115 120 125 

Ser Ser Ala Thr Val Tyr Gly Gin Pro Glu Lys Val Pro Cys Phe Glu 
130 135 140 

Asp Ser Pro Leu Lys Ala Leu Asn Pro Tyr Gly Arg Thr Lys Leu Tyr 
145 150 155 160 

Leu Glu Glu Met Leu Arg Asp Tyr Gin His Ala Asn Pro Glu Trp Arg 
165 170 175 

Thr He Leu Leu Arg Tyr Phe Asn Pro He Gly Ala His Glu Ser Gly 
180 185 190 

Asp He Gly Glu Asp Pro Lys Gly Val Pro Asn Asn Leu Leu Pro Tyr 
195 200 205 

He Gin Gin Val Ala Val Ala Arg Arg Pro Glu Leu Asn Val Tyr Gly 
210 215 220 

His Asp Tyr Arg Thr Arg Asp Gly Thr Ala Val Arg Asp Tyr He His 
225 230 235 240 

Val Val Asp Leu Ala Asp Gly His He Ala Ala Leu Glu Lys Leu Phe 
245 250 255 

Ala Thr Pro Asp He Gly Cys Val Ala Tyr Asn Leu Gly Thr Gly Arg 
260 265 270 

Gly Thr Thr Val Leu Glu Met Val Ser Ala Phe Glu Lys Ala Tyr Gly 
275 280 285 

Lys Lys He Pro Val Lys Met Cys Pro Arg Arg Pro Gly Asp Ser Glu 
290 295 300 

Gin Val Tyr Ala Ser Thr Ala Lys Ala Glu Glu Glu Leu Gly Trp Arg 
305 310 315 320 

Ala Lys Tyr Gly He Glu Glu Met Cys Arg Asp Gin Trp Asn Trp Ala 
325 330 335 

Lys Lys Asn Pro Tyr Gly Tyr Cys Gly Asn Ala Ala Glu Asn Lys Asp 
340 345 350 

<210> 17 
<211> 1393 
<212> DNA 
<213> Zea mays 



<400> 17 

gcacgaggat tgatctccgt gacaagggag cactggaaat ggtttttgct tctacaagat 60 
ttgaagctgt cattcacttc gctggattga aagctgtggg tgaaagcgta cagaagccat 12 0 
tactttatta tgacaacaac gtcattggca cgataaatct tctagaagtt atgtctgttc 180 
acggttgcaa gaagttggtg ttctcatcat cagctgcagt ttatggatca cccaaaaact 240 
caccctgcac agaaaatttt cctcttactc caaacaatcc atatggcaaa acaaagctcg 300 
ttgttgaaga tatttgccgg gatatctacc gttcagatcc tgaatggaag atcattttac 3 60 
ttaggtactt caatccagtt ggtgctcatc ctagtggata tcttggcgag gacccacgag 420 
gaattcccaa caatcttatg ccctatgttc agcaagttgc ggttggtagg aggccagctc 4 80 
taacagtttt aggaaatgac tatgcaacaa gagatgggac tggggtccga gattacatcc 54 0 
atgtggttga ccttgctgac ggacatattg ctgcattgca gaagcttttt gagaactcta 600 
gcatagggtg tgaagcgtac aaccttggaa ccggaagagg tacatctgtg ctggagattg 660 
ttaaagcatt tgagaaggct tctgggaaga aaatacctct gatttttggt gaaagacgcc 720 
caggtgatgc agagattctg ttttcagaga ctactaaagc agagagggag cttaactgga 7 80 
aagcaaaata cggtattgaa gagatgtgcc gcgaccaatg gaactgggcc agcaagaacc 840 
cttatggcta tggatcacct gactctatca agcagaatgg tcaccaaaca aacggatccg 900 
ctgactcctc caagcagaat ggccaccgca caaacggttc aactgactca cccaagcgga 960 
acggccacca tgcgtatggg tctgctgact cacccaagcg caacgggcac tgcgtttttg 1020 
gatcatcaga cctcaagccg aatggtaatg gccacctgcg ctgagcagaa ctgtttggcc 1080 
tgtgagctcc ctgtacattc ggttgcgatg tgagctccct gcacgttcgg tcgaggtcta 1140 
tcgtgaaccc actatccgag attgatgtgg atcattgggt tgacaggtca tacagtatag 12 00 
agccggtggc agaggaattc ctgtttgctg tgggtaaagc ttatcttctg ctttcgtgtt 12 60 
ttttcttgct tctttcgatt atggtgtagg aatgtggtca taatgtatta gctgattatc 1320 
ctttccctgc taattggact ttattacgct tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1380 
aaaaaaaaaa aaa 13 93 

<210> 18 
<211> 353 
<212> PRT 
<213> Zea mays 

<400> 18 

Thr Arg lie Asp Leu Arg Asp Lys Gly Ala Leu Glu Met Val Phe Ala 
15 10 15 

Ser Thr Arg Phe Glu Ala Val lie His Phe Ala Gly Leu Lys Ala Val 
20 25 30 

Gly Glu Ser Val Gin Lys Pro Leu Leu Tyr Tyr Asp Asn Asn Val lie 
35 40 45 

Gly Thr lie Asn Leu Leu Glu Val Met Ser Val His Gly Cys Lys Lys 
50 55 60 

Leu Val Phe Ser Ser Ser Ala Ala Val Tyr Gly Ser Pro Lys Asn Ser 
65 70 75 80 

Pro Cys Thr Glu Asn Phe Pro Leu Thr Pro Asn Asn Pro Tyr Gly Lys 
85 90 95 

Thr Lys Leu Val Val Glu Asp lie Cys Arg Asp He Tyr Arg Ser Asp 
100 105 110 

Pro Glu Trp Lys He He Leu Leu Arg Tyr Phe Asn Pro Val Gly Ala 
115 120 125 

His Pro Ser Gly Tyr Leu Gly Glu Asp Pro Arg Gly He Pro Asn Asn 
130 135 140 



Leu Met Pro Tyr Val Gin Gin Val Ala Val Gly Arg Arg Pro Ala Leu 
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145 



150 155 160 



Thr Val Leu Gly Asn Asp Tyr Ala Thr Arg Asp Gly Thr Gly Val Arg 
165 170 175 

Asp Tyr lie His Val Val Asp Leu Ala Asp Gly His He Ala Ala Leu 
180 185 190 

Gin Lys Leu Phe Glu Asn Ser Ser He Gly Cys Glu Ala Tyr Asn Leu 
195 200 205 

Gly Thr Gly Arg Gly Thr Ser Val Leu Glu He Val Lys Ala Phe Glu 
210 215 220 

Lys Ala Ser Gly Lys Lys He Pro Leu He Phe Gly Glu Arg Arg Pro 
225 ^ 230 235 240 

Gly Asp Ala Glu He Leu Phe Ser Glu Thr Thr Lys Ala Glu Arg Glu 
245 250 255 

Leu Asn Trp Lys Ala Lys Tyr Gly He Glu Glu Met Cys Arg Asp Gin 
260 265 270 

Trp Asn Trp Ala Ser Lys Asn Pro Tyr Gly Tyr Gly Ser Pro Asp Ser 
275 280 285 

He Lys Gin Asn Gly His Gin Thr Asn Gly Ser Ala Asp Ser Ser Lys 
290 295 300 

Gin Asn Gly His Arg Thr Asn Gly Ser Thr Asp Ser Pro Lys Arg Asn 
305 310 315 320 

Gly His His Ala Tyr Gly Ser Ala Asp Ser Pro Lys Arg Asn Gly His 
325 330 335 

Cys Val Phe Gly Ser Ser Asp Leu Lys Pro Asn Gly Asn Gly His Leu 
340 345 350 



Arg 



<210> 19 
<211> 1498 
<212> DNA 

<213> Oryza sativa 
<400> 19 

gcacgagatc actcttcttc ttccgctctc 
cacacacgca cacaacaaca acaagagtaa 
ttgcggacga tcctggtgac gggcggcgcc 
cttctccaac tcggcttccg cgttgtcgtc 
gccatcctcc gcgtcaggga actcgccgga 
gttgacctcc gcgacaagca agcgttggac 
gtcatccatt ttgccgggct gaaagctgtt 
tacgacaaca acc teat egg caccatcact 
accaagctgg tgttctcatc atccgcaact 
actgaagaat ccccactttg tgcaatgaac 
gacatgtgcc gggatctgea tgcctcagac 
ttcaaccctg ttggagctca cccaagcggg 
aacaacctca tgcccttcgt ccagcaggtc 
tatggaaccg actacaacac caaggatgga 



tagctttget ttgettgett catcaaaccc 60 

tcaaagtaga agaagatggt ttcggccttg 12 0 

ggctacatcg gcagccacac cgtcctccag 180 

ctcgacaacc tcgacaacgc ctccgagctc 240 

cacaacgcca acaacctcga cttccgcaag 300 

caaatcttct cctctcaaag gtttgaggct 360 

ggcgagagcg tgeagaagee cctgctttac 420 

ctcctgcagg tcatggccgc acatggctgc 480 

gtctacgggt ggcccaagga ggtgccctgc 540 

ccctacggca gaacaaagct ggtaatcgaa 600 

ccaaactgga agatcatact gctccgatac 660 

tacattggtg aggacccctg cggcatccca 720 

gctgttggca ggaggcegge ccttaccgtc 780 

actggggttc gtgactatat ccatgttgtt 840 



gatctagcgg atggtcatat cgccgcgtta aggaagctct atgaagattc tgatagaata 900 
ggatgtgagg tgtacaatct gggcactgga aaggggacat ctgtgctgga aatggttgca 960 
gcattcgaga aagcttctgg aaagaaaatc ccgcttgtat ttgctggacg aaggcctgga 1020 
gatgccgaga tcgtttacgc tcaaactgcc aaagctgaga aggaactgaa atggaaggca 1080 
aaatacgggg tagaggagat gtgcagggac ctgtggaatt gggcgagcaa gaacccctac 114 0 
gggtatggat cgccggacag tagcaactga tccagctgaa tataggcgtc caatcctcca 1200 
gtagcagcag cagcagcatg acttctatac atatatatat ataatcataa agaatgaaga 1260 
aacaaagaat tcggacttgt tgagttacta ctactactac tactaatccc atctgatgga 1320 
ccgcattgta tagggggctt gtaggggtcc agcagcttca tcatcagtct ccttaggagg 13 80 
cctctaatat aatctccata tttatggtag aaataaattt tgcccaccgt ggaagaacta 1440 
tataatagaa tcatgatgat ttgttgatta aaaaaaaaaa aaaaaaaaaa aaaaaaaa 1498 

<210> 20 
<211> 354 
<212> PRT 

<213> Oryza sativa 
<400> 20 

Met Val Ser Ala Leu Leu Arg Thr lie Leu Val Thr Gly Gly Ala Gly 
15 10 15 

Tyr lie Gly Ser His Thr Val Leu Gin Leu Leu Gin Leu Gly Phe Arg 
20 25 30 

Val Val Val Leu Asp Asn Leu Asp Asn Ala Ser Glu Leu Ala lie Leu 
35 40 45 

Arg Val Arg Glu Leu Ala Gly His Asn Ala Asn Asn Leu Asp Phe Arg 
50 55 60 

Lys Val Asp Leu Arg Asp Lys Gin Ala Leu Asp Gin lie Phe Ser Ser 
65 70 75 80 

Gin Arg Phe Glu Ala Val lie His Phe Ala Gly Leu Lys Ala Val Gly 
85 90 95 

Glu Ser Val Gin Lys Pro Leu Leu Tyr Tyr Asp Asn Asn Leu lie Gly 
100 105 110 

Thr He Thr Leu Leu Gin Val Met Ala Ala His Gly Cys Thr Lys Leu 
115 120 125 

Val Phe Ser Ser Ser Ala Thr Val Tyr Gly Trp Pro Lys Glu Val Pro 
130 135 140 

Cys Thr Glu Glu Ser Pro Leu Cys Ala Met Asn Pro Tyr Gly Arg Thr 
145 150 155 160 

Lys Leu Val He Glu Asp Met Cys Arg Asp Leu His Ala Ser Asp Pro 
165 170 175 

Asn Trp Lys He He Leu Leu Arg Tyr Phe Asn Pro Val Gly Ala His 
180 185 190 

Pro Ser Gly Tyr He Gly Glu Asp Pro Cys Gly He Pro Asn Asn Leu 
195 200 205 

Met Pro Phe Val Gin Gin Val Ala Val Gly Arg Arg Pro Ala Leu Thr 
210 215 220 



Val Tyr Gly Thr Asp Tyr Asn Thr Lys Asp Gly Thr Gly Val Arg Asp 
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230 235 240 



Tyr lie His Val Val Asp Leu Ala Asp Gly His lie Ala Ala Leu Arg 
245 250 255 

Lys Leu Tyr Glu Asp Ser Asp Arg lie Gly Cys Glu Val Tyr Asn Leu 
260 265 270 

Gly Thr Gly Lys Gly Thr Ser Val Leu Glu Met Val Ala Ala Phe Glu 
275 280 285 

Lys Ala Ser Gly Lys Lys lie Pro Leu Val Phe Ala Gly Arg Arg Pro 
290 295 300 

Gly Asp Ala Glu lie Val Tyr Ala Gin Thr Ala Lys Ala Glu Lys Glu 
305 310 315 320 

Leu Lys Trp Lys Ala Lys Tyr Gly Val Glu Glu Met Cys Arg Asp Leu 
325 330 335 

Trp Asn Trp Ala Ser Lys Asn Pro Tyr Gly Tyr Gly Ser Pro Asp Ser 
340 345 350 



Ser Asn 



<210> 21 

<211> 1532 

<212> DNA 

<213> Glycine max 



<400> 21 

gaattcggca cgagcgcaaa ctttcttcca 
ccgaatatgg catcgcgcgt cagcattggc 
tcccctcact ttcgctcacc acttaagatt 
cataaggtac ttatgcgcga taagactgta 
agccacaccg ttcttcagct cttgctcgga 
gaaaattcct ccgaggttgc catccacaga 
aacctctcct ttcacaaggt ggacctacgg 
tccacacaat tcgatgctgt catacatttt 
caaaaacctt tactatacta taacaacaac 
atggctgccc atggatgcaa gaagctcgtg 
ccaaaggagg ttccatgcac agaagagttc 
actaagctta tcattgaaga aatttgccgt 
ataattttgt taagatactt caacccagtt 
gatcctcgtg gaattccaaa caatctcatg 
cggcctgcac tgacagtttt tggaaatgat 
gattacattc atgttgttga tttagcagat 
gaacctaata taggttgtga ggtttataac 
gagatggtta gagcttttga aatggcatct 
cgtagacctg gtgatgctga aattgtttat 
aaatggaagg caaaatatgg cattgatgag 
aaaaaccctt atggctatgg agatcagggc 
gggttctttt ctgaactcac ccacaccgta 
aaaagctttt ctttctttag tgatcttaag 
tagttacata gtaagtaagt agtggtttac 
cttcttacag gttattgatt attagattcg 
ctgttttgta aatctaaatc acatgtttgt 



aacgaacgtg tcacaaaatt ctcgccttct 60 
aaccttacct cctccgcgcc gtatattaat 120 
tccaacaacc cctctctgca aaacgcttcg 180 
ctggtaaccg gcggagccgg ttacatcggc 24 0 
ggtttcagag ccgtcgtcct cgacaacctc 300 
gtcagggagc tcgccggcga atttgggaac 360 
gacagagctg ctctagacca aatattttct 420 
gctggactga aagcagtagg agaaagtgtg 480 
ttgactggga caatcactct attggaagtc 540 
ttttcatctt cagcaactgt atatggttgg 600 
cctctgtcag caatgaaccc atatggacga 660 
gatgtccact gtgcagagcc agattgtaaa 720 
ggtgcacacc ccagtggtta tattggggag 780 
ccatttgttc agcaagtagc agttggccga 84 0 
tataatacaa gtgatggcac tggggttcgg 900 
gggcacattg ctgcattgct taaactagat 960 
ctgggaacag gaaagggaac atcagttttg 102 0 
ggaaagaaaa ttccacttgt gatggctggc 1080 
gcatcaacaa agaaagcgga aagagagctt 1140 
atgtgccgtg atcaatggaa ttgggctagc 12 00 
tccaccgatt aaccacttag ttttctcttt 1260 
gtccgtaggt cttgtgaatt tagttttccc 1320 
gtgacaaagt acttgtatta ttactattca 13 80 
tatactgtaa tttaaaggtt ctctaggttc 1440 
gattctctca tgttccacat gagcagcatc 1500 
tt 1532 



<210> 22 
<211> 349 



<212> PRT 

<213> Glycine max 



<400> 22 

Met Arg Asp Lys Thr Val Leu Val Thr Gly Gly Ala Gly Tyr lie Gly 
15 10 15 

Ser His Thr Val Leu Gin Leu Leu Leu Gly Gly Phe Arg Ala Val Val 
20 25 30 

Leu Asp Asn Leu Glu Asn Ser Ser Glu Val Ala lie His Arg Val Arg 
35 40 45 

Glu Leu Ala Gly Glu Phe Gly Asn Asn Leu Ser Phe His Lys Val Asp 
50 55 60 

Leu Arg Asp Arg Ala Ala Leu Asp Gin lie Phe Ser Ser Thr Gin Phe 
65 70 75 80 

Asp Ala Val lie His Phe Ala Gly Leu Lys Ala Val Gly Glu Ser Val 
85 90 95 

Gin Lys Pro Leu Leu Tyr Tyr Asn Asn Asn Leu Thr Gly Thr lie Thr 
100 105 110 

Leu Leu Glu Val Met Ala Ala His Gly Cys Lys Lys Leu Val Phe Ser 
115 120 125 

Ser Ser Ala Thr Val Tyr Gly Trp Pro Lys Glu Val Pro Cys Thr Glu 
130 135 140 

Glu Phe Pro Leu Ser Ala Met Asn Pro Tyr Gly Arg Thr Lys Leu lie 
145 150 155 160 

lie Glu Glu lie Cys Arg Asp Val His Cys Ala Glu Pro Asp Cys Lys 
165 170 175 

lie lie Leu Leu Arg Tyr Phe Asn Pro Val Gly Ala His Pro Ser Gly 
180 185 190 

Tyr lie Gly Glu Asp Pro Arg Gly lie Pro Asn Asn Leu Met Pro Phe 
195 200 205 

Val Gin Gin Val Ala Val Gly Arg Arg Pro Ala Leu Thr Val Phe Gly 
210 215 220 

Asn Asp Tyr Asn Thr Ser Asp Gly Thr Gly Val Arg Asp Tyr lie His 
225 230 235 240 

Val Val Asp Leu Ala Asp Gly His lie Ala Ala Leu Leu Lys Leu Asp 
245 250 255 

Glu Pro Asn lie Gly Cys Glu Val Tyr Asn Leu Gly Thr Gly Lys Gly 
260 265 270 

Thr Ser Val Leu Glu Met Val Arg Ala Phe Glu Met Ala Ser Gly Lys 
275 280 285 



Lys lie Pro Leu Val Met Ala Gly Arg Arg Pro Gly Asp Ala Glu lie 
290 295 300 



n. ji n :& ,Jl „:dj 'C3 ""i" 1.1 a> sui* a. n,Jfi.ci, 



Val Tyr Ala Ser Thr Lys Lys Ala Glu Arg Glu Leu Lys Trp Lys Ala 
305 310 315 320 

Lys Tyr Gly He Asp Glu Met Cys Arg Asp Gin Trp Asn Trp Ala Ser 
325 330 335 

Lys Asn Pro Tyr Gly Tyr Gly Asp Gin Gly Ser Thr Asp 
340 345 

<210> 23 
<211> 490 
<212> DNA 

<213> Triticum aestivum 
<220> 

<221> unsure 
<222> (61) 

<223> n - A, C, G, OR T 
<220> 

<221> unsure 
<222> (73) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (81) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (207) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (246) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (284) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (319) 

<223> n - A, C, G, OR T 
<220> 

<221> unsure 
<222> (332) 

<223> n = A, C, G, OR T 
<220> 

<221> unsure 
<222> (378) . . (379) 
<223>n=A, C, G, ORT 



<220> 

<221> unsure 



^-.n „.3 -*l.h '£3 ii. iuj o ulj Jin* n:z, 



<222> (409) 

<223> n = A, C, G, OR T 
<220> 

<2 21> unsure 
<222> (413) 

<223>n==A, C, G, ORT 



<220> 

<221> unsure 
<222> (418) 
<223> n = A, 

<220> 

<221> unsure 
<222> (455) 
<223> n = A, 

<220> 

<221> unsure 

<222> (461) 

<223> n = A, 



C, G, OR T 



C, G , OR T 



C, G, OR T 



<220> 

<221> unsure 
<222> (468) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (480) 

<223> n = A, C, G, OR T 



<220> 

<221> unsure 
<222> (482) 

<223> n = A, C, G, OR T 



<400> 23 

aagaaacaag 

ntgaccggcg 

ttccgcgtcc 

cgacaactcg 

aaggangcgc 

aagaagtatc 

caaattaaat 

tgctgctata 

cncctacgag 



agagcaagga 
gcncggggta 
tcgtagtcga 
ccaacgcccc 
tcgaccaaat 
ttttttcgng 
gttgattnnt 
gcaaatacga 



agaagaagat 
natcggcagc 
cagcctcgac 
gcaaaanagc 
cttctcctcc 
cttattatta 
ccgtccgtcc 
ctcctttcta 



ggtgtctgcg 
cacaccgtgc 
aacgcctccg 
ctcgacttcc 
caaaggtatc 
anaattaact 
cggccgtcgt 
tcagnatcgt 



gtgttgagga 
tgcagctgct 
aggaggccat 
gcaaggtgga 
ttcnactttt 
atagtatatt 
gccagccanc 
ngtcgttngt 



cgattcctgg 
cctgcagggc 
ccgccgcgtc 
ccttcgtgac 
ttccgcaaaa 
attgagtcca 
canccgtntc 
aggtgtcaan 



60 
120 
180 
240 
300 
360 
420 
480 
490 



<210> 24 
<211> 103 
<212> PRT 

<213> Triticum aestivum 



<220> 

<221> UNSURE 
<222> (4) 

<22 3> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
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<222> (6) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 

<222> (48) 

<22 3> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (61) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 

<222> (74) 

<22 3> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 
<222> (86) 

<223> Xaa = ANY AMINO ACID 



<220> 

<221> UNSURE 

<222> (90) 

<223> Xaa = ANY AMINO ACID 



<400> 24 

Thr Gly Gly Xaa Gly Xaa lie Gly 
1 5 

Leu Gin Gly Phe Arg Val Leu Val 
20 

Glu Glu Ala lie Arg Arg Val Arg 
35 40 

Ser Leu Asp Phe Arg Lys Val Asp 
50 55 

Gin lie Phe Ser Ser Gin Arg Tyr 
65 70 

Lys Tyr Leu Phe Ser Xaa Leu Leu 
85 



Ser His Thr Val Leu Gin Leu Leu 
10 15 

Val Asp Ser Leu Asp Asn Ala Ser 
25 30 

Gin Leu Ala Asn Ala Pro Gin Xaa 
45 

Leu Arg Asp Lys Xaa Ala Leu Asp 
60 

Leu Xaa Leu Phe Ser Ala Lys Lys 
75 80 

Leu Xaa lie Asn Tyr Ser lie Leu 
90 95 



Leu Ser Pro Gin lie Lys Cys 
100 



<210> 25 
<211> 350 
<212> PRT' 

<213> Pisum sativum 



<400> 25 

Met Val Ala Ser Ser Gin Lys lie Leu Val Thr Gly Ser Ala Gly Phe 
15 10 15 

lie Gly Thr His Thr Val Val Gin Leu Leu Asn Asn Gly Phe Asn Val 
20 25 30 



iLu J^ii aus " **r 



if ifiT"a *i~ !■ ~ J '11 

-if -iLji n^u in;:: 



Ser lie lie Asp 
35 

Val Arg Glu Val 
50 

Leu Gly Asp Leu 
65 

Ser Lys Phe Asp 



Glu Ser Val Glu 
100 

Thr lie Asn Leu 
115 

Val Phe Ser Ser 
130 

Cys Val Glu Asp 
145 

Lys Leu Phe Leu 



Glu Trp Arg lie 
180 

Glu Ser Gly Lys 
195 

Met Pro Tyr lie 
210 

Val Tyr Gly His 
225 

Tyr lie His Val 



Lys Leu Phe Thr 
260 

Thr Gly Arg Gly 
275 

Ala Ser Gly Lys 
290 

Asp Ala Thr Glu 
305 

Gly Trp Lys Ala 



Asn Trp Ala Lys 
340 



Asn Phe Asp Asn 
40 

Val Gly Ser Asn 
55 

Arg Asn Lys Asp 
70 

Ala Val lie His 
85 

Asn Pro Arg Arg 



Tyr Glu Val Met 
120 

Ser Ala Thr Val 
135 

Phe Lys Leu Gin 
150 

Glu Glu lie Ala 
165 

Val Leu Leu Arg 



Leu Gly Glu Asp 
200 

Gin Gin Val Ala 
215 

Asp Tyr Pro Thr 
230 

Met Asp Leu Ala 
245 

Ser Glu Asn lie 



Ser Ser Val Leu 
280 

Lys lie Ala Leu 
295 

Val Tyr Ala Ser 
310 

Lys Tyr Gly Val 
325 

Asn Asn Pro Trp 



Ser Val Met Glu 



Leu Ser Gin Asn 
60 

Asp Leu Glu Lys 
75 

Phe Ala Gly Leu 
90 

Tyr Phe Asp Asn 
105 

Ala Lys His Asn 



Tyr Gly Gin Pro 
140 

Ala Met Asn Pro 
155 

Arg Asp lie Gin 
170 

Tyr Phe Asn Pro 
185 

Pro Arg Gly lie 



Val Gly Arg Leu 
220 

Arg Asp Gly Ser 
235 

Asp Gly His lie 
250 

Gly Cys Thr Ala 
265 

Glu Met Val Ala 



Lys Leu Cys Pro 
300 

Thr Ala Lys Ala 
315 

Glu Glu Met Cys 
330 

Gly Tyr Ser Gly 
345 



Ala Val Glu Arg 
45 

Leu Glu Phe Thr 



Leu Phe Ser Lys 
80 

Lys Ala Val Gly 
95 

Asn Leu Val Gly 
110 

Cys Lys Lys Met 
125 

Glu Lys lie Pro 



Tyr Gly Arg Thr 
160 

Lys Ala Glu Pro 
175 

Val Gly Ala His 
190 

Pro Asn Asn Leu 
205 

Pro Glu Leu Asn 



Ala lie Arg Asp 
240 

Ala Ala Leu Arg 
255 

Tyr Asn Leu Gly 
270 

Ala Phe Glu Lys 
285 

Arg Arg Pro Gly 



Glu Lys Glu Leu 
320 

Arg Asp Gin Trp 
335 

Lys Pro 
350 



<211> 350 
<212> PRT 

<213> Cyamopsis tetragonoloba 
<400> 26 

Met Ser Ser Gin Thr Val Leu Val Thr Gly Gly Ala Gly Tyr lie Gly 
15 10 15 

Ser His Thr Val Leu Gin Leu Leu Leu Gly Gly Phe Lys Ala Val Val 
20 25 30 

Val Asp Asn Leu Asp Asn Ser Ser Glu Thr Ala lie His Arg Val Lys 
35 40 45 

Glu Leu Ala Gly Lys Phe Ala Gly Asn Leu Ser Phe His Lys Leu Asp 
50 55 60 

Leu Arg Asp Arg Asp Ala Leu Glu Lys lie Phe Ser Ser Thr Lys Phe 
65 70 75 80 

Asp Ser Val lie His Phe Ala Gly Leu Lys Ala Val Gly Glu Ser Val 
85 90 95 

Gin Lys Pro Leu Leu Tyr Tyr Asp Asn Asn Leu lie Gly Thr lie Val 
100 105 110 

Leu Phe Glu Val Met Ala Ala His Gly Cys Lys Lys Leu Val Phe Ser 
115 120 125 

Ser Ser Ala Thr Val Tyr Gly Leu Pro Lys Glu Val Pro Cys Thr Glu 
130 135 140 

Glu Phe Pro Leu Ser Ala Ala Asn Pro Tyr Gly Arg Thr Lys Leu lie 
145 150 155 160 

lie Glu Glu lie Cys Arg Asp lie Tyr Arg Ala Glu Gin Glu Trp Lys 
165 . 170 175 

lie lie Leu Leu Arg Tyr Phe Asn Pro Val Gly Ala His Pro Ser Gly 
180 185 190 

Tyr lie Gly Glu Asp Pro Arg Gly lie Pro Asn Asn Leu Met Pro Phe 
195 200 205 

Val Gin Gin Val Ala Val Gly Arg Arg Pro Ala Leu Thr Val Phe Gly 
210 215 220 

Asn Asp Tyr Thr Thr Ser Asp Gly Thr Gly Val Arg Asp Tyr lie His 
225 230 235 240 

Val Val Asp Leu Ala Asp Gly His lie Ala Ala Leu Arg Lys Leu Asn 
245 250 255 

Asp Pro Lys lie Gly Cys Glu Val Tyr Asn Leu Gly Thr Gly Lys Gly 
260 265 270 

Thr Ser Val Leu Glu Met Val Lys Ala Phe Glu Gin Ala Ser Gly Lys 
275 280 285 



Lys lie Pro Leu Val Met Ala Gly Arg Arg Pro Gly Asp Ala Glu Val 



Val Tyr Ala Ser Thr Asn Lys Ala Glu Arg Glu Leu Asn Trp Lys Ala 
305 310 315 320 



Lys Tyr Gly lie Asp Glu Met Cys Arg Asp Gin Trp Asn Trp Ala Ser 
325 330 335 

Lys Asn Pro Tyr Gly Tyr Gly Gly Ser Glu Asp Ser Ser Asn 
340 345 350 

<210> 27 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 27 
catggaggag cag 

<210> 28 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 28 
ctgctcctc 



